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Abstract: In this paper we probe the typical states of the superstar ensemble of Q using 
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1. Introduction 

The thermodynamical nature of black holes was revealed by studying the response of black 
holes to both perturbations in their defining parameters (mass, angular momentum, charge) 
H , as well as their response to the presence of quantum fields in the bulk of their geometry 
|3|, [|] . This unexpected nature of black holes led to the famous black hole puzzles 1 : The 
black hole entropy puzzle, and the information loss puzzle. A lot of effort was spent in the 
the last four decades or so to solve these puzzles. Among the ideas that emerged during this 
investigation is the fuzzball proposal, first advanced by Mathur and collaborators [||, ||, 0. 
By now there are many nice reviews on the subject, see for example Jj], || |j| [R], |TT], |l2|, |i~3|| . 
The idea of the fuzzball proposal is that the black hole geometry is an effective description 
of an underlying exponentially large system of microstates. It is widely believed that 
some of these microstates manifest themselves as smooth geometries on the gravity side. 
However, depending on the kind of black holes under consideration, not all of them can 
have a gravity description [14]. Another point of view that was advanced in [15], is that 



the black hole geometry is the effective description of a set of microstates that do not have 
a smooth geometry description. The situation is far from being conclusive and we will 
not be concerned with these issues in the present paper. For a further discussion on these 



issues see e.g. []16|, |17|| . 



One of the papers that went beyond comparing the macroscopic and microscopic en- 



tropies in checking the fuzzball proposal is Jl|] (see also [18] for a different approach). They 
studied a specific ensemble of heavy half-BPS states of type-IIB string theory on asymp- 
totic AdSsX S 5 , called the superstar ensemble, and constructed its effective dual geometry 
(see [|l9| for the application of similar ideas to the case of the D1-D5 system). It turned 
out that this geometry is the same as the one of the superstar of pC|| . This led them to 
conjecture that the superstar is an effective description of the superstar ensemble in line 
with the fuzzball proposal. They supplemented this claim with further checks using some 
correlation functions. We initiate in this paper a further check of this proposal by studying 
the effect of light half-BPS probes on the heavy states of the superstar ensemble. We find 
that at leading order, the final answer is universal and does not depend on the details of 
the typical states of the superstar ensemble. 

This paper is organized as follows. In the second section we quickly review the superstar 
ensemble and discuss some important properties of its typical states that will be useful later 



1 Another puzzle, the singularity puzzle, is usually added to the list. This puzzle questions the validity 
of general relativity near the singularity and has nothing to do with the thermodynamical nature of black 
holes. That is why we did not include it in our list. 
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on. This material is not new and can be found in Q. After that, we introduce the different 
classes of probes that we will be dealing with in this paper and discuss some of their 
properties. In the third section, we introduce the two point function which is the main 
quantity we will be evaluating in this paper. Essentially, half-BPS states of type-IIB string 
theory on asymptotic AdSs x S 5 can be described using Young diagrams (YDs) which are 
treated as irreducible representations of the unitary group 2 U(iV), where N is the flux of 
the background geometry [23, 24 1. To evaluate the two point function we end up facing the 
problem of decomposing the tensor product of two U(./V) representations into irreducible 
ones. This is the topic of the fourth section. Although it is impossible to completely carry 
out this decomposition, we manage to extract enough information about it to be able to 
evaluate the leading order of the two point function. In the fifth section, we calculate the 
leading order term of the log of the two point function for the different classes of probes. 
We close the paper by discussing these results and pointing out some further directions of 
research. We left some details to the appendices. Among them let us mention appendix-^ 
since it includes a summary of our notations and YD terminology that is heavily used in 
this paper. We advise the reader to read it before reading the main part of the paper 
(sections |I| and ||) . 



2. Backgrounds and probes 

Our central aim in this paper is to probe a class of "heavy" half-BPS states of type-IIB 
string theory on asymptotically AdSs x S 5 spacetimes, using "light" half-BPS states of the 
same theory. By heavy we mean states with energy /conformal weight that scales as N 2 , 
whereas by light we mean states with energy/conformal weight that scales slower than A'" 2 . 
Since on the gravity side, the heavy half-BPS states backreact on spacetime and generate 
the bulk geometry, an LLM geometry p5fl , we will call them the background states. The 
light half-BPS states on the other hand probe these geometries, hence we name them 
probes. 

Our probe analysis will take place entirely in the dual conformal field theory, the M = 4 
SU(A) super Yang-Mills theory |2q ]. As is well known the half-BPS states of this theory 
can be described using YDs with at most N rows (2^, |24|], where the total number of its 
boxes is the conformal weight of the corresponding state. As a result, we will heavily use 
the YDs technology in this paper. For the needed notions, properties, as well as conventions 
used in this paper we refer the reader to appendix-^. 

In this section, we will review the background states that we are interested in. We 
will discuss some of their main properties that will be crucial later on. After that, we will 
discuss the type of probes we will be using and some of their most important properties 
which will play a prominent role in sections || and || 

2 Whether the actual group is SU(iV) or U(iV) is still a matter of debate. We expect that our leading 
large N result will not be modified if we use SU(iV) group instead. See pl| , |22[ for details on the use of the 
SU(iV) group and modifications to be brought to the U(JV) formulas. 
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2.1 The superstar ensemble 

In the following we review the superstar ensemble discussed in [jl|, section 3.3] in some 
detail. Our main interest is the main structure of the typical YDs of this ensemble. Hence, 



we will neither discuss the mapping between the LLM geometries [25| and the YDs of 
this ensemble, nor its effective description in terms of the superstar of [2C]. We refer 
the interested reader to the paper Q. We will start by describing the ensemble and its 
"average" YD. Then, we will discuss some general properties of typical states that will be 
of interest to us in the bulk of the paper. We will be following closely jLj]. 

2.1.1 The ensemble and its limiting shape YD 

The superstar ensemble is the set of YDs with fixed number of columns D, fixed number 
of rows N and fixed number of boxes A, such that: 

D~N, A = ^ND~N 2 , (2.1) 

which are weighed equally. Following ||], we will denote by rj the length of row i, and by 
Cj the number of columns of length j 3 . In our conventions (see appendix-^), we have the 
relations: 

cn = r N , Ci=ri- n+i ; 1 < i < N - 1 . (2.2) 

We can describe the superstar ensemble using a canonical ensemble. The associated parti- 
tion function is given by: 

oo N 

Z- V e -"E» £<L,c j = n * , (2.3, 

C 1 ,C 2 ,...,C N = 1 j = l 

where q = , p = e~ x , (3 and A are some positive parameters that will be fixed later on. 
Since we are dealing with the canonical ensemble instead of the microcanonical one, we 
need to fix the average of the number of boxes A as well as the average of the number of 
columns D such that: 

N N . j 

A = (Y,JC,) = q d q lo g Z = J2^ 1 f-j = ^ND, (2.4) 



N N 

V 



D = {Y J C 3 )=pd p \ogZ = Y J T^— • (2.5) 
i=i 3=1 



We restrict ourselves to spelling out the results here leaving the details to appendix-]^. We 
find that by fixing /3 and p as: 

1 - q D D 1 

p = T^^~dT1^ p ~n> (2 ' 6) 



3 Notice that we have a different convention for numbering the YD rows than the one used in E). We 
start the numbering from top to bottom, whereas they number the way around, from bottom to top. This 
is the reason we have different expressions for and Ci than them. 
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we satisfy the constraints above. The scaling of logq with N is fixed by evaluating the 
entropy of this ensemble and using that logg = (dS/dA). 

When dealing with very large YDs that come with certain probability /weight, it is 
usually beneficial to construct the limit shape YD. This YD can be thought of as the 
"average" YD as it is constructed by finding a relation between the average length of a 
row and its position. In our present case, if we denote by y{x) the length of a row whose 
position is given by its number x, the limit shape YD turns out to be a triangle with the 
diagonal given by the equation: 

y(x) = D (l - Jl) . (2.7) 

According to the usual intuition from statistical mechanics, most of the YDs in the super- 
star ensemble will be close to this limit shape YD. How close are they will be the subject 
of the next subsection. 

2.1.2 Typical states 

In the following we will summarize some of the main properties of typical YDs in the 
superstar ensemble that will be of importance to us. More precisely we want to know the 
deviation between a random typical YD O and the limit shape YD Oq. Since the number 
of boxes, columns and rows is held fixed in the superstar ensemble, the only thing that can 
happen is for boxes near the diagonal of Oq to move from one row to another. 

Let us first worry about boxes in the same row. A good estimate for the number of 
moved boxes is given by <j(D). The reason being that for the limit shape YD there is a 
linear relation between D and the length of a row i, Oi = D (1 — i/N). For a more precise 
treatment of a(Oi) see jjj. We have: 

N n 

AD) = £ ((cf) - (a) 2 ) = (pd p ) 2 XogZ « - (D + N) ~ N . (2.8) 
i=l 

Hence the fluctuation in the length of a row is of order yN. This is the usual thermal 
fluctuation since the length of almost all of the rows of the limit shape YD is of order N. 

The other quantity that will be crucial to us is the total number of migrating boxes 
i.e. the total number of boxes that are moved around when comparing a typical YD in the 
superstar ensemble with its limit shape YD. A good estimate of that is the fluctuation in 
A. We have: 

a 2 (A) = (qd q ) 2 \ogZ ^ -D{D + N) ^ N 2 . (2.9) 

Hence, the fluctuation in A is of order N. Once again we find the usual thermal fluctuation. 

Let us summarize what we found here. If we pick a random typical YD O from the 
superstar ensemble, the length of a row in O and the same row in the limit shape YD differ 
at worst by order yN boxes. If we add up the absolute values of these differences 4 we get 
a number that is at worst of order N. 

4 Since the number of total boxes is held fixed in the superstar ensemble, these differences should add up 
to zero. 
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2.2 The probes and their SU(iV) and Sh dimensions 

After we discussed the most important properties of our background YDs, we turn our 
attention to the probe YDs. Their main property is that the total number of their boxes h 
is much less than A 2 i.e. h -C N 2 . Another property of these YDs is that the total number 
of their rows n equals N at most. This is because we will be treating them as irreducible 
representations of U(iV) when evaluating their two point function in our backgrounds, see 



section 3.1 for more details. 

Among all possible probe YDs, we will limit our discussion in this paper to the homoge- 
neous YDs. These are YDs where the ratio of the numbers of rows (columns) whose length 
does not scale with N in the same way as the number of columns d (respectively rows n) 
to the total number of rows (respectively columns) tends to zero in the limit N — > oo. If d 
denotes the total number of columns, n the total number of rows then we have: 

h~nd. (2.10) 

For the other kinds of probes, we should think of them as the result of a tensor product 
of two or more homogeneous probes. In a sense, the homogeneous YDs are our building 
blocks that generate all the other YDs by the means of taking the tensor product between 
them. Form now on, whenever we talk about a YD we mean a homogeneous one. 

To fix the notation once and for all, ij; will stand for a probe YD, the length of its i th 
row, d the total number of its columns, n the total number of its rows, tpo = Max{d, n}, 
and h the total number of its boxes. A collection of conventions to be used throughout 
this paper is collected in appendix-^. 

In the remaining of this section we will study the leading behavior of dim^vV' t ne di- 
mension of a probe YD ip as an irreducible representation of SU(A) as well as dim/j ijj its 
dimension as an irreducible representation of the permutation group Sh- These quantities 
will play an important role in this paper as they are intimately connected to the decompo- 
sition of the tensor product O <S> ip (see section |] for more details). The latter will play a 
role in the evaluation of the two point function ( |3.lD whose leading term we are after. Let 
us first start by giving two different formulas for the dimension of an SU(A) irreducible 
representation specified by a YD ip (see for example |27], P8fl). The first one is in terms of 
the difference in the lengths of different rows and reads: 

N-l N-k / i i \ 

dim^=n n( i +^ iz r !j± ) • ( 2 - n ) 
k=i i=i ^ ' 

The second formula is in terms of the hook lengths (see appendix-^). It reads: 

dim N if> = J — , (2.12) 

where H.^ = \\ i • fyjj), is the hook length associated to the box (J, j). Although we 

will heavily use the first expression, we will still need the second expression since it has a 
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similar form as the dimension of tp as a representation of the permutation group Sh- The 
latter reads (see for example [27, pSfl): 

dim h ^=^-. (2.13) 

Since we are working with very large quantities, it is much useful to evaluate the log of 
these dimensions given that we are interested in the large N limit. Since the number of 
rows of ij) is given by n < N, we find using equation ( p. 11 ): 



n— 1 n—i , I I \ n N—i , , \ 

io g dim^V' = EE lo g( 1 + ^ !±1 )+E £ 1 og( 1 + -)- ( 2 - 14 ) 

It is clear that the leading term of the expression above will depend on how scales with 
N. Such behavior leads us to classify the probes into three classes which are the following: 

• Generic probes class: In this case both n and d are very small compared to N i.e. 
n, d <C N . The reason we call them generic is that they exist for all regimes of h of 
interest to us (h < N 2 ). 

• Linear probes class: In this case either n or d but not both scales as N . The reason 
we coin them the name linear is that the leading behavior of the log of their SU(iV) 
dimension is linear in h as we will see below. This class of probes is associated to the 
following regime of h: N < h -C N 2 . 

• Long probes class: In this case d^> N. Notice that this class of probes exists only 
in the regime /i> N. 

When evaluating the leading order of the log of the SU(iV) and dimensions of ip in the 
following, we will discuss each class of probes on its own. For reasons that will be clear 
later on (see section |j), the leading behavior of dim/j^ will be of interest to us only if 
h N. We will keep the discussion general and leave the treatment of a concrete example 
to appendix-^. But before continuing with the discussion of the dimensions of different 
probe classes, let us pause for a moment and discuss a curious duality of YDs 5 that will 
be useful below. 

2.2.1 An approximate duality of Young diagrams 

We know that two YDs that are related by the flip row column have the same Sh 
dimension. This is easily understood from equation ( 2.13] ) since the number of boxes as 



well as the hook length remain the same under such a flip. We will show in this section 
that the leading term of log dimjy'0 exhibits such an invariance for 6 d < N. 



5 This looks like a manifestation of the hole/particle symmetry |3^] but not quite. This is because 
dim at if} does not have a clear physical interpretation. However through its connection with the decompo- 
sition of the tensor product O ® ip (see section ^) it has a connection with the aforementioned symmetry. 

6 This is because n < N as a result of the YD being an U(JV) representation. 
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We will assume in the following that d <C n. This assumption is not restrictive since 
we know that such YDs are one end of this duality. The only non-covered case is when 
d ~ n, but as we will see later on, this case fits nicely in the manipulations used for d <C n. 
A property that will be used below, which will be argued for in subsections 2.2.2) and 



2.2.3| below and further checked in a concrete example in appendix-^, is that the leading 



behavior of log dimjvV' is either of order h log N or at worst of order h for the cases of 
interest to us. We will take these properties as granted for now. 

Our starting point is the relation (|2.12j ) in the case d <C n. First, we rewrite it as: 

N-l ipi N-l ipi 

log diniTv ip = l °S( N ~ ^ + i + J) " Yl £ lo S Hhi) ■ ( 2 - 15 ) 

i=l j=l i=l j=l 

The duality in this formula is a statement about the exchange i •<-)■ j. The part that 
depends only on the hook length is trivially invariant under such a change. The quantity 
(i + j) is also invariant. The only problematic part is ipi. However its contribution is much 
smaller than h and hence subleading given our claim above. To prove this claim, we use 
that ipi <C N to expand the log term and get: 

N-l A , N-l /2 

6 log dim^v v « y y -rr^— < y < « ^ > 

^ ^ N + i + j ^ N + i 

i=i j=i J i=i 

where we used that h ~ nd. Notice that these manipulations do not work if d 3> n. But 
this is not a problem since this regime will be at the other end of the duality for the case 
n > d. 

As already advanced in the beginning, the same manipulations lead to the same con- 
clusion in the case n ~ d. The reason being that in this case both n and d <C N since 
we are interested in cases where h <C N 2 . Hence, these kind of YDs belong to the generic 
class. Although the correction that we get form the terms depending on ipi will be of order 
h, the leading behavior of log dim^r ip is of order h logiV (see the subsection below, and 
also appendix-^)). Hence the duality survives in this case as well. 

2.2.2 The generic probes class 

In this case we have n <C N and ipi <C N. In turns out that we need to distinguish between 
two cases: nCd, and d <C n. The case d ~ n is a trivial consequence of the previous two 
cases. The easiest case to deal with is when we have n <C d. Let us discuss it first, then 
turn to the second case d <C n. In the case n <C d (or n 2 <C h), we have from equation 
(2.14) the following upper bound: 



n N-i 

log diniTv tp < y y [log (j + ipi)~ log j] 

i=l j=l 
n 

^ y i( N - i + 1>i) l °g( N + (N-i) log(iV -i)-ipi log ipi] 
i=i 

<y[iPi logN + iPi-iPi logiPi] log i^-j , (2.16) 
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where in the first line we used that {ipi — tpi+j) < ipi f° r 1 < i < n. To move form the first 
to the second line we used equation ( |B.6| ) to evaluate the sum over j, and to get the third 
line we used that ipi <C N and n <C N. The last result is a consequence of: 

log A-h log d = J2^i lo M^) ( 2 - 17 ) 

i=l i ^ ' i=l 

where we used the fact that the ratio (ipi/d) is independent of N for almost all of the rows. 
For the lower bound of ( |2.14 ), we get: 

n N-i 

log dim at i>>^2 ^2 t log ^ lo &j] 

i=l j=n+l—i 

n 

> h log iV + h - [(n - i + ipi) log(n - i + 1 + ipi) - (n - i) log(n - i + 1)] , 



i=i 



where in the first line we use that the first sum in ( 2.14| ) is a positive number. To get the 
last line, we used the same steps as above. We need to deal with the last sum. We find 
using that ipi ~ d 3> n the following approximate value: 

n n 

} J [(n-i + ipj) log(n - i + 1 + ipj) - (n - 1) log(rt - % + 1)] } j ipj log ipj » h log c? , 

i=l i=l 

which we discussed before. Plugging this result in the expression for log dim^r ip above, we 
find: 

log dim^ ip > h log I — 



Combining this lower limit with the upper limit in ( 2.16| ), we conclude that the leading 
behavior of log dimAr ip in the case N ^> d S> n is given by: 

logdimArV-Zilog^^ . (2.18) 

What about the other case u>(i? For this we take advantage of the duality discussed 
in section- 2. 2.1 to swap the rows and columns of ip, which brings us to the previous case 
where n here plays the role of d there and vice versa. Hence, in the case n> d, the leading 
behavior of log dimwip is given by: 

(N 

log dim^v ip ~ h log — 
\ n 

As a conclusion, we find that the leading behavior of log dimjv^ in the large N limit in 
the case where n and d <C N is given by: 

log dimAr ip ~ h log (J£j , (2.19) 

where ipo = Max{d , n}. 
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The next quantity we want to find is the leading term of log dim/j ip in the case h <C N . 



The only non-trivial term in the dimension relation (2.13) is H^. Its leading behavior can 



be derived using the expression (2.12) together with the leading behavior (2.19). First, we 
need to deal with the numerator of (2.12), we have: 



n ipi 

log numjv tp = log(N + j — i) 

i=i j=i 

n 

« [{N - i + A) log(iV - i + ^) - {N - i) \og{N 

-i=i 



log(JV-t) « h logiV , 



(2.20) 



i=i 



where we use the approximation (|B.q ) to get the second line, then we used that ipi <C N 
and n -C to approximate the sums in the second and third line respectively. Next, we 
plug the leading behavior of log dim^r ip given in equation ( p. 19 ) and the leading behavior 
of the numerator of ( p. 12 ) that is given in ( [2.20 ) above, in the equation ( |2.12 ), to get the 
following leading behavior of the product over the hook lengths: 



log U^^h log tpo , 



where ipo = Max{d, n}. As a result, the expression (2.13) for the dimension of ip as a 
representation of Sh leads to the following leading behavior: 



log dim/j ijj ss h log 



h 

V'o 



(2.21) 



Notice that in the case where h ~ ipo, the leading behavior of log dim^^ will be propor- 
tional to h. We will mostly use the expression above for the leading behavior of the Sh 
dimension of this class of probes, keeping in mind the special case h ~ ipQ. 

This is the only class of probes where we will need the leading behavior of their 
dimension as a representation of Sh- In the remaining of this section we will look for the 
leading behavior of the SU(A^) dimension of YD in the other two classes. Although a 
universal exact expression for the leading term is not always possible, we will derive its 
leading behavior in the worst situations. 



2.2.3 The linear probes class 

In this class of probes, we have either d ~ A^ or n ~ N. Following the same route as above, 
we first discuss the case d ~ N, then use the duality discussed in section- |2.2.l| for the case 
n ~ AT. The reason we can use this duality will be clear below. 

Our starting point is once again the relation ( 2.1 4j ) . We have in the case d ~ A^ 3> n 
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the following upper bound: 



n N-i 

log diniAT t/j < ^ ^ [ lo g(i + V'i) - log j] 

i=l j=l 



< ^ [(N - i + ^) log(iV + ^) - (iV - i) log N — tpi log Vi] 

i=l 
n 

< £ [(TV + fr) log(l +&) - log^] ~ h , 



1=1 



where we used the approximation ( |B,6| ) to evaluate the sum over j in the first line and 
that n <C A" to get the second line. In the last line we introduced the quantity ip i = ipi/N ', 
which is independent of N. At the end we used that ipi ~ d ~ N and h ~ nd ~ nAT. The 
reason we did not try to get an exact result here is that knowing that the upper bound is 
of order h is more than enough for our purposes, see section | for more details. 

To complete the circle of thoughts, although not needed, let us look for a lower bound 
on the leading behavior of log dini/v ip- We have: 

n N—i 

log diniAr i>>^2 ^2 [ log ( J + ~ lo S j\ 

1=1 j=n+l—i 
n 

>^2[(N-i + V;) log(AT + Vi) -(N-i) log N] 
i=i 

n 

~ t( n ~ i + ^») ^ogV'i - (n - i) log(n - i)] 

i=l 

n 

> ^ [(AT + ^) log(l + - ^ log V^] ~ h , 



i=i 



Where we used the same steps as in the derivation above. Notice that we get the same 
expression as for the upper bound. Hence we conclude that in the case d ~ N, we have 
the following leading behavior of log dimjv tp- 

n 

log dimjv i> ~ ^2 [(N + Vi) log(l + - V>i log&] ~ h , (2.22) 

i=l 

which is of order h as argued above. 

This was for the case d ~ N , what about the other possibility n ~ AT. Once again we 
take advantage of the duality discussed in section- 2.2.1 to safely conclude that the leading 
behavior of the dimension in this case (n ~ N), is given by: 

n 

log dimTv ip « ^2 i(N + rii) log(l + fii) - n» log n*] ~ /t , (2.23) 

i=l 

where rii stands for the length of the columns of ip and fii = rii/N. 
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All in all, we conclude that the leading behavior of the dimension of the linear probes 
is such that: 

log dim N ip ~ h , (2.24) 
where the actual value of the non-zero coefficient that multiplies h is not important to us. 
2.2.4 The long probes class 

The shape of these probes suggest that their dimension will be subleading with respect 
to the previous two cases. This is because in ( p. 12 ) will be of the same order as the 
numerator. Let us proceed and check this intuitive guess. As usual we are going to look for 
an upper and a lower bounds of log dim^v ip using equation fl2.14Q . For the upper bound, 
we find: 

n N-i 

log diniTv ip < ^2 X t log ( J + ~ log ^ 
i=i j=x 
n 

~ X) K N ~ i + l °g( N + log^i -(N-i) \og(N - %)] 

i=l 

n / d\ 

<Y,i N lo S^ - iV log iV] « niV log ( - , 
i=i ^ ' 



where we used as usual (|B.6|) to evaluate the sum over j to get the second line. To get the 
third line we used that ipi S> N S> n. To get the last result we used that: 



n n . 

n log d — } log if)i=) log — ~ n 



since the ratio (ipi/N) is N- independent for most ipi. 

What about the lower bound? Using the same manipulations as above, we easily find: 

n N-i 

log dirmv ip > ^2 X [ lo §(^ + 3) ~ 1o SJ] 

i=l j=n+l—i 
n 

~ J2 ^ N ~ 1 + ^) h %( N -i + i>i)-{n-i + ^i) Iog(n - % + &)] 
i=i 

n 

- ^2 i( N - 1 °S( N (n-i) log(n - i)] 

i=i 

n / d \ 

> J2 [N log ~N logiV] «niV log f -J . 
i=i ^ ' 

Combining this result with the upper bound above, we conclude that the leading behavior 
of the SU(iV) dimension of these probes is given by: 

log dim^ V> « n N log (A) . (2.25) 
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Notice that by exchanging the roles of N and d, one can map this leading term to the 
corresponding one in the case of generic probes given in ( 2.19| ), taking into account that 
h ~ nd. 



3. Probes in AdS 5 x S 5 

The states we are dealing with in this paper are half-BPS states of type-IIB string theory 
on asymptotic AdSsX S 5 . These states can be described on the dual field theory side 
using YDs seen as irreducible representation of U(iV), where N is the number of fluxes in 



the background geometry [23, 20]. We are mainly interested in probing the backgrounds 
associated to the microstates of the superstar of J2(J according to the proposal advanced in 
Q . Remember that these microstates are characterized by YDs whose number of columns 
N c , number of rows N r and number of boxes A are all held fixed as follows: 



N r = N , N C = D~N , A = ^NN C , 

see subsection |2.1| and for more details. In this section we will spell out the probing tool 
that we will be using in this paper in its full generality. After that, we will warm up with 
a toy model that describes a background that does not belong to the superstar ensemble 
for reasons that will be clear later on. 

3.1 The two point function 

The game we will be playing in this paper is as follows. We have a background that is 
generated by a typical state O of the superstar ensemble and we add on top of that a light 



probe if), then see what happens. According to [23], modulo an overall trivial space time 
dependence which is completely fixed by conformal symmetry, everything boils down to 
evaluating the two point function: 

W fl> = «°° ( y» , (3., 

where ip is our probe, O is our background, and the vacuum two point function (a (3) is 
given by: 

{a/3) = S aP ]J(N-i + j) , (3.2) 

i,j 

where 5 a p is a schematic notation that means that the two YDs a and /3 should be identical, 
and the product is over all the boxes of the YD a where i is the column number and j 
is the row number. The expression fl3.1| ) is evaluated as follows. We first decompose the 
tensor product O ® ift into irreducible representations of U(iV): 

O ® y, = d k <fk , (3.3) 

k 

where dk is the degeneracy of the YD tp^, and k is a summation index that will not play 
any important role in the following. Next, we use that the two-point {O O') is bilinear to 
find: 
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where we used that the two point function (|3,2| ) is diagonal and introduced the notation: 



<• * )o = WW) < (3 - 5) 

so that we do not get a cluttered expression. We will be using this simplified notation from 
now on. So our task can be summarized into the following steps: 

1. Get the needed information from the tensor product decomposition (|3,3|). These 
include the degeneracy dk, the type of YDs ip^, and their total number. From now 
on, we will refer to the tensor product of two irreducible representations of SU(iV) 
by the tensor product between the associated YDs, in an abuse of language. 



2. Evaluate the term {tpk <pk)o m the expression (3.4). 



3. Finally collect all the intermediate results to get the final answer. 

We will deal with each step on its own in the following sections. Some of the details will 
be left to appendices that we will refer to at the right places. But before doing so, let us 
discuss a simple toy model. The results obtained here will serve as a good reference point 
later on. 



3.2 A toy model: A background outside the superstar ensemble 

The decomposition of the tensor product O ® ip is very involved in general (see subsection 
|4.1.1| ), but there are some simple situations where things become straightforward. One of 
the simplest cases correspond to a background YD Q where all of its N rows have the same 
length: 

Qi = kN , 

where k is an arbitrary constant. Although this background Q is not of immediate im- 
portance to us since it is not part of the superstar ensemble, it is both a good warm up 
exercise as well as a good reference point for the two point function ([O]) of the superstar 
ensemble. 

The simplicity of the background Q resides in the fact that there is only one YD tpo 
when decomposing the tensor product Q<g> ip. If we denote by 6i the number of boxes added 
to the z th row of Q to form the YD tpo then we have: 

5i = ipi . 

The two point function ( p.lj ) simplifies drastically in this case as we need to only evaluate 
(ifo v?o)q- The latter can be easily evaluated using its defining equation ( |3.5| ) together with 
the explicit expression of the vacuum two point function (a j3) given by ( |3.2| ) . Using theses 
equations we find that: 

N S t 

log (ipo ifo) q = £ lQ g( N + Qi + 3-i)i (3-6) 
i=i j=i 
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where Qi stands for the length of row i of the YD Q. We can easily evaluate the sum over 
j using equation (B.6) to get the following approximate expression: 

N 



log {(p k (p k ) Q &-h+ Y,( N + Qi + $i-i) log(iV + Qi + 5i-i) 

i=l 

N 

Y^(N + Qi - i) log(N + Qi - i) , (3.7) 



»=i 



where we used Yli=i $i = ^- Let us now specify the formula to our case where Qi = kN 
and 5i = ipi nonzero only for i < n. We get: 



log (ip <Po)q ~ - h + Y^K 1 + K ) N + fa ~ *1 MC 1 + K ) N + 
i=i 

n 

- J^[(l + K)iV - i] log[(l + - i] . 

1=1 

To proceed further, we need to distinguish between different probe classes. 

The generic probes class In this class we have both <C N and n <C N. We find 
after expand the log: 

log((^ tp)) Q = log (990 (fo)a~h logiV . (3.8) 

The linear probes class In this class we have either d ~ N or n ~ N . In the case 
n ~ iV, we have <i <C iV and hence we can use the same manipulations as before to get: 

log((^»e«>il°g#. (3-9) 

In the opposite case ipi ~ A r , we have n <C iV so we can expand the log term around i = 0. 
We get in this case the same result as above. 

The long probes class In this class we have d S> N and n <C N. Expanding the log 
term appropriately, we get: 

log((^»S~Mogd. (3.10) 

Notice this result does not depend explicitly on N. Let us now turn to the more interesting 
case where the background O is a typical state in the superstar ensemble. 



4. Dealing with the tensor product O <g> ip 

Our starting point in evaluating the two point function ( p.l| ) is to construct the tensor 
product decomposition ( |3.3| ). For our purposes, we do not need to get all the details of this 
decomposition which is hopeless. We only need to get an estimate of the degeneracies dk of 
the YDs ipk, an estimate of the total number of these YDs dt = ^2 k dk, and a rough idea on 
the shape of the YDs <pk so that we can evaluate the quantities (ipk <Pk)o- I n the following 
we will start by reviewing the construction of the YDs that appear in the decomposition 
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of the tensor product of two YDs of XJ(N). By studying carefully the conditions on these 
YDs that appear in such decomposition, we propose a map between semi-standard Young 
tableaux (SSYTx) and these YDs. Next, using this map we discuss the shape of "dominant" 
YDs (fk and their degeneracies dk according to the scaling of h with N. Some of the details 
will be left to appendices [E] and [F]. 

4.1 The tensor product decomposition and the semi-standard Young tableaux 

The first question to answer is how to construct the YD (p^ starting from the YD O and 
ip when the tensor product O ® tp is seen as a tensor product between two irreducible 
representation of U(iV)? Fortunately there is a well known recipe to construct the decom- 
position of the tensor product of two YDs. We will first review this recipe, then we will 
repackage the information about such decomposition into labelings of tp in the case of our 
interest. 

4.1.1 Decomposing the tensor product of two representations 

Let A and B be two YDs associated to two irreducible representations of U(iV). The 
YDs corresponding to the irreducible representations appearing in the decomposition of 
the tensor product A B are constructed by adding all the boxes of one of the two YDs, 
say B, to the other YD, here A, in all possible ways subject to the following rules (see for 
example Chapter.9]): 

• First, we fill each box of the YD B with a label a,, where i stands for the number of 
the row the box belongs to. 

• We start by adding the left most box 7 form the first row of B, which carries the label 
ai, to the YD A in all possible position such that we end up a YD of U(iV) i.e. the 
length of rows is decreasing from top to bottom, and the length of each column is 
at most N. We repeat the same process with the remaining boxes in the first row 
of B starting from left to right, keeping in mind that each new added box should be 
to the right or below the previous one and that no two boxes among the added ones 
are in the same column. The latter two conditions are solely for boxes that belong 
to the same row in B. 

• We repeat exactly the same process for each row of the YD B until we finish all of 
its boxes. 

• Finally, we keep only the YDs that satisfy the following rule. Let C denote one of 
the resultant YDs. We start our journey at the upper right box of the YD C going 
from the right to the left of the first row, then move down to the second row and, 
once again, start from its right most box and move to the left, and so on and so forth 
until we reach the lower left box of C. At each box in this journey, the number of the 

7 This is a matter of choice as boxes belonging to the same row are treated on equal footing, hence the 
reason they carry the same label. In most literatures one starts from the right most box, but we choose to 
start from the left most one for reasons to be clear later on. 
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newly added boxes with label ctj should not exceed the number of boxes with label 
dj if j < i. 

At the end we collect the resulting YDs according to their shape. Since the decomposition 
of the tensor product will play a crucial role in this paper, it is a good idea to name the rules 
in the construction above, instead of trying to explain them each time. Essentially, we have 
three important rules which are as follows. The YD rule will refer to the condition that 
the resultant digram should be a YD at each step of adding a new box. The antisymmetry 
rule will refer to the condition that two boxes originating from the same row of the YD B 
should not belong to the same column in the resulting YDs. The ordering rule will refer 
to the combination of the order in which we add boxes originating from the same row in 
the YD B (explained in the first step above) which will be called the row rule from now 
on, and the last condition above which will be called the column rule from now on. 

Before continuing, let us apply these rules in the case of the following tensor product: 
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(4.1) 



In this example we used letters to label the boxes, which is fine since we are still following 
the rules. Notice that in this example we have a single YD that appears twice with different 
order of the letters a, b, hence it is doubly degenerate. Notice also that the box with the 
letter b never appears in the first row. This is a straightforward consequence of the ordering 
rule. In general, boxes from row i in B cannot be attached to the row j in A if j < i. 

There is another immediate consequence of the ordering rule which has to do with the 
number of added boxes to a row i in A. Let us denote by n\ the number of boxes from row 
j in B that are added to row i in A. Suppose that i is the row we are interested in, and 
suppose also that the added boxes to this row belong to the rows m < k < n from B. We 
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have the following conditions: 

i-l 

i k i < i™)' 1 ~ n j ) 5 Vfc > m + 1 < k < n . 



n, 

i=i 



Summing these relations gives us: 



i-l 



k=m j=l 

where b m is the length of row m in 5. Hence, if the boxes added to row i of the YD A used 
to be in rows of the YD B with m denoting the number of the upper row among them, 
then the number of added boxes to the row i is smaller or equal the number of boxes in the 
row m of the YD B. These two properties will have a nice interpretation after introducing 
our map between the YD ip^ and SSYTx ip. See appendix-^ for more details. 

Our next target after we learned how to decompose a tensor product into irreducible 
representations (YDs) is to count the number of the resulting YDs and their degenera- 
cies. This will be the subject of the next few sections. We will first start by encoding the 
information about YDs that appear in the tensor product decomposition of O ® ip into 
labelings of ip. Then, using this new way of characterizing the tensor product decomposi- 
tion, we proceed to deal with our main target, estimating the number of YD ipk and their 
degeneracies cZ&. 

4.1.2 Repackaging the information about the tensor product decomposition 

Our aim in this paper is to study the two point function (3.1) as we vary the probe ip. 



Hence, it will be very effective and fruitful to use the YD ip in packaging the information 
about the tensor product 0(8)ip. It is clear from the way we construct the decomposition of 
the tensor product of two YDs that it is much easier in the present case O ® ip to distribute 
the boxes of ip on O. A naive guess for packaging the information about the YDs that 
will appear in the decomposition of O ® ip, is to associate to each box of ip the number of 
the row of the YD O it is attached to. As a result, we get for each ipf. a certain labeling 
of ip. To get more information about the type of these labelings, we need to answer the 
following question: What are the implications of the tensor product decomposition rules 
that we discussed previously on the labelings of ip? 

The easiest condition to apply on the possible labelings of ip is that the number of 



rows of ifk should not exceed N. Since the YD O has already N rows, see section 2.1 
the labels of ip should be among the numbers £ = {1,2,..., A^}. For a specific ipk the 
collection of the labels of the associated labeling of ip will in general be a subset of C. This 
observation will play an important role later on. What about the impact of the three rules: 
the YD rule, the antisymmetry rule, and the ordering rule? It is clear that the last rule, 
the ordering rule, is universal in the sense that its outcome does not depend on the YD O 
in contrast to the other two rules whose outcomes depend highly on which background O 
we are studying. Hence, it is a good idea to look first for the implication of the ordering 
rule on the possible labelings of ip, then look for possible corrections due to the other two 
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rules. In the remaining of this subsection, we will only discuss the impact of the ordering 
rule on the labelings of ip leaving the inclusion of the other two rules mainly to appendix-^. 

The ordering rule as defined previously (section [4.1.1] ) has two parts to it. The first 
part, called the row rule, has to do with boxes that belong to the same row of ip whereas 
the second one, called the column rule, has to do with boxes that belong to different rows of 
ip. It is easy to see that the row rule is equivalent to the requirement that in the associated 
labeling of ip, the numbers in the same row should be weakly increasing from left to right. 
What about the column rule? To understand its consequence, let us assume for a moment 
that ip is a YD with a single column. To simplify the discussion below we will also number 
the boxes of ip according to their row i.e. box i means a box in row i of ip. It is easy to 
see that in this case, the column rule is equivalent to the condition that the box i should 
not be added to the rows of O that are before the one that has box j added to it if j < i. 
Hence, the acceptable labelings of ip in this case should be such that the numbers labeling 
boxes of the single column are strictly increasing from top to bottom. What about the 
case where ip has two columns? Let us concentrate on the four boxes belonging to rows 
i and j where j > i. The column rule implies that the row to which the box j from the 
first column of ip is attached to, should be below the row to which the box i from the first 
column of ip is attached to. The same condition should be satisfied by the boxes i and j 
from the second column of ip. Hence, in this case the column rule requires that the number 
labeling boxes in the same column of ip to be strictly increasing from top to bottom for an 
acceptable labeling of ip. It is not hard to see that this is the manifestation of the column 
rule in the acceptable labelings of a generic ip. This can be easily seen by iterating the 
previous discussion for each row of ip. 

In our previous example of the tensor product decomposition Q4.1] ), the labelings of 
the resultant YDs are: 
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(4.3) 



which clearly satisfies the last two conditions: numbers in the same row are weakly increas- 
ing from left to right, and numbers in the same column are strictly increasing from top to 
bottom. The two YTx between bracket correspond to the degenerate YD. 

To summarize, the shape of O, the fact that we are dealing with the U(iV) group 
together with the ordering rule implies that the acceptable labelings of ip that are associated 
to a YD ifk in the decomposition of O (g) ip should be such that: 

• The labels belong the set of numbers £ = {1,2,..., N}. 

• The numbers labeling boxes in the same row should be weakly increasing from left 
to right. 



• The numbers labeling boxes in the same column should be strictly increasing from 
top to bottom. 
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These last two conditions define the so called SSYT, see appendix-^]. If we take into account 
the first condition, the number of these SSYTx tjj is the dimension of ip as a representation 
of SU(iV), see for example [27|. Since we are not taking into account neither the YD rule 
nor the antisymmetry rule, dim^r ip is an upper bound on the total number of the YDs (p^ 
i.e. d t = Yjk d k< dmiAr -0. 

Parameterizing the decomposition of the tensor product O ® ip in terms of SSYTx ip 
gave us an idea about the total number of the YDs but what we are really after is 
the value of the degeneracies and the range of the index k. Let us pick a YD ip^. This 
YD is fixed once we know the iV-tuple (3 = {n\ , n<i , ... , njy), where ni stands for the 
added boxes to row i of the YD O. Its degeneracy d^ on the other hand corresponds to 
the total number of the possible different row origins 8 of theses added boxes in the YD 
ip. In our interpretation of the tensor product decomposition as associating fillings to the 
YD ip to get a SSYT, the rij in the iV-tuple /? counts the number of times the number 
i appears in the SSYT. As a result the iV-tuple /3 is precisely a filling, see appendix-^. 
So, the degeneracy dk is bounded from above 9 by the number of different SSYTx ip that 
are associated to the same filling ft Such a number is called a Kostka number and is 
denoted by K-tp,p, see appendix-^. Actually there is a precise relationship between Kostka 
numbers and degeneracies of YD appearing in the decomposition of a tensor product, see 
for example |^2| . However, we will not need this exact relation in what we are trying to 
do in this paper. 

Before continuing let us clarify a point that will play an important role in extracting 
the leading behavior of dk in the following. We concluded that we can repackage the 
information about the tensor product O <S> ip in terms of SSYTx where the labels take 
values in £ = {1,2,..., N}. We mentioned above that the number of SSYTx ip with 
a certain filling j3 is given by the Kostka numbers K^p a, however this is not completely 
exact. Essentially the Kostka number a encodes only the information about ip and j3 
being partitions of the same integer h, but not the information about the range of labels, 
see for example [|33[j . Remember that f3 is a collection of positive integers ft, some of them 
can be zero, which are ordered according to the index i. We know on the other hand that 
a partition of an integer is a collection of strictly positive ordered integer that sum to that 
integer. So we seem to have a mismatch between the number of SSYT of interest to us 
and Kostka numbers: we can have zero entries in the filling j3 as well as ordering that 
depends on the index of the entry ft and not its value. We will discuss the issue of the 
zeros here and leave the ordering issue to appendix-^. We will also discuss some further 
useful properties of Kostka numbers in the same appendix-]^. 

Remember that in order for a YT to be SSYT, the labeling should have a certain 
ordering along the rows and the columns of the associated YD (see appendix-^). This 
means that if (3 has some zero entries, the number of SSYTx is the same as the one for 
a filling f3 which is constructed from f3 by omitting the zero terms, then relabeling its 
remaining entries ft keeping the order of their index untouched. Take as an example 
g = (ft , 0, 0, At, AO, then g = (ft , , AO such that ft = ft, ft = ft and ft = ft. 
8 Remember that boxes in the same row are treated symmetrically. 

9 Since we are not taking into account the modifications due to the YD and antisymmetry rules. 
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The reason that the number of SSYTx for both fillings is the same is that we did not alter 
the ordering of the index i, so the conditions on the order along the rows and column of 
the YT remain intact. Notice that has the usual interpretation that it counts how many 
times the index i appears in the labeling of the YD. 

4.2 The tensor product O ® ip: I-The case h < N 

We are ready to tackle our main question of estimating the maximal possible degeneracy 
of a YD in the decomposition of O <g> tp through the corresponding Kostka number. Our 
idea is to use dim^r tp since it is related to the number of possible SSYTx and hence to the 
Kostka numbers. Remember that dim^y ip is the number of all possible SSYTx with labels 
among the set {1 , 2 , ... , N}. This number has two contributions: The first one is the 
number of possible choices of labels, and the second one is the associated Kostka number 
K aj p. Using that K a> /3 = if \(3\ < n, where n is the number of rows of ip and \(3\ is 
the number of nonzero entries of the filling (3 (see appendix-^), it is easy to see that the 
dimension of ip as a representation of SU(iV) written in terms of Kostka numbers reads: 

dim N 4>= Yl C N K i>,P, ( 4 - 4 ) 

P; n<|/3|<Min{Af ,h} 

where n is the number of rows of ip, \(3\ stands for the number of nonzero elements in the 
filling /3, h is the number of boxes of ip, C™ is the usual binomial coefficient: 

rim n - 



m\ (n — m)\ ' 

and K^ t p is the Kostka number associated to the filling (3. The expression ( [4.4| ) above 
reflects the fact that when labeling the YD ip, we first need to choose the labels to use, 
among all the possibilities C = {1 , 2 , ... , N} which gives rise to the binomial coefficient, 
then we need to decide on the possible multiplicities of these labels (i.e. choose a filling) 
each of which gives rise to its associated Kostka numbers. 

Notice that not all fillings (3 are allowed in the sum (fO|) above as the corresponding 
Kostka number might vanish. Our approach will be to include all of these fillings f3 in the 
sum keeping in mind that some of the associated Kostka numbers can vanish. How many 
(3's are there? For a fixed \f3\, the number of possible fillings 7 such that | — y | = is the 
same as the number of partitions of the integer h into |/3| strictly positive integers 7^ that 
are ordered according to their index i and not their value. This is similar to the problem 
discussed in section- of appendix-^. The difference resides in the fact that we want to 



count partitions that do not have zero entries. It is easy to adapt the counting there to 
this case by putting from the start one ball in each box, reducing the number we want to 
partition to h — \f3\. Hence we get the number: 

= C^- 1 . (4.5) 

It is clear that this is an over-estimate for the actual number of fillings 7, in the sense that 
not all the associated Kostka numbers are different from zero, but this will be enough for 
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our arguments in this and the next subsection. Using the estimate above, the total number 
of fillings P is bound from above by: 



h 

101=1 

It is an upper bound since we allowed for fillings /3 with \/3\ < n in this sum for which 
we know that their associated Kostka number vanishes. We have also allowed the range 
of summation to run all the way to h even if h > N. Notice that we are still missing the 
contribution of the binomial coefficient cL/3| to get the actual number of fillings, however 
we will not do so here and keep calling Mr above the total number of fillings in an abuse 
of language. 



As is clear from the form of the expression (4.4), our search for the "dominant" SSYTx 
ip will depend on whether h or N is bigger. We will concentrate on the case h <C N here 
and leave the cases h ~ N and h S> N to the next subsection. Before continuing with 
our discussion, let us be more precise. First of all, what we really mean by comparing h 
and N is comparing the leading behavior of h with N. Hence, h = N — yN is part of the 
cases h ~ N . The next point we need to clarify is what class of probes we are interested 
in. Remember that we classified our probes into three families: generic, linear, and long 



probes, see subsection 2.2. Since we are interested in the leading behavior of the two point 



function ((tp tp))oj given by equation (3.1), and as it is clear from the discussion in section 
|5] and the leading behavior of dini/v tp for the different families (section ^^), as well as the 
fact that dimTv if) constitutes an upper bound on the total number of the YDs that appear 
in the tensor product decomposition of O ® ift, we will be only interested in the generic 
probes class. Remember that the YDs of this class are the ones with the number of rows 
n as well as the number of columns d are much smaller than N i.e. n, d *C N. 

The idea in the case h <C N is to maximize both the binomial coefficient c]^ and the 
Kostka number Kj, a independently. Notice that the sum in this case is over the range 
n < \P\ < h. Hence, the maximum of the binomial coefficient c3 is reached for |/3| = h. 
This is satisfied only for the filling (3q = (1) 10 , which gives rise to standard Young tableaux 
(SYTx). The reason we have only one filling /3 = (1) for \/3\ = h is that the number of 
fillings in a family with fixed |/3| is the number of -tuples (n\ , n-i , . . . , n^) such that: 

|J8| 

V i ; rii > , m = h . 

i=i 

It is clear from this condition that the only solution in the case |/3| = h is rtj = 1 , V i, and 
hence the claim that /3q = (1) is the only filling such that \/3\ = h. What about the Kostka 
number «? A moment thought reveals that this number is also maximized by the filling 
/?o = (1)- This is because in this filling all the labels used to label the boxes of the YD ip 
are different from each other. Combining this observation with the fact that for SSYTx, 
the labels along the same column should be strictly increasing from top to bottom leads us 



3 The notation /3o = (1) means that all the entries in the filling /3o equal 1. 
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the aforementioned claim. As a result, the filling with the maximum contribution to the 



some in (4.4) is /3q = (1). Notice that maximum here does not mean that the contribution 
coming from the filling (3q = (1) dominates over the other contributions to the sum (|4.4|), 
It will be nice to find out if there is a condition on the shape of the YD ip such that this is 
true 11 . What is the value of the Kostka number K^^^? Remember that this is the same 
as the number of SYTx associated to the YD ip, which in turn is the dimension of the YD 
ip when seen as an irreducible representation of the permutation group Sh which is given 
by (see equation ( 2.13| )): 

K a (1) = dim h ip = — . 



As a side note, one can get this formula by rescaling N as (A N) in both the formulas (| 
and ( |2.12| ) of dim^r ip without touching the YD ip, then picking out the leading contribution 
to both of them in the limit A — > oo and equating them. This scaling argument may be 
helpful in answering the question raised previously about the condition of ip so that the 
dominant contribution to ( |4.4[ ) comes form SYTx. 

Let us continue our investigation of the SYTx associated to ip in the present situation 
i.e. the labels of the fillings of ip are subsets of C = {1 , 2 , . . . N}. Their total number 
taking into account the contribution form the different choices of labels is given by Nsyt = 
Mab x Nperm, where: 

log Mub = log « h log ^ , log M per = log dim h ip w h log ^- , (4.7) 

h ip 

where Ni a b is the number of possible labelings and M per m the number of SYTx ip given 
fixed labels. Notice that: 



log dimjv ip « log Mob + log M 



perm 



This is a very interesting observation as it suggests that we can restrict ourselves to the 
SYTx ip when studying the SSYTx ip if we are only interested in leading order quantities. 



For example, the contribution from the other fillings to (iA) is bounded from above by: 

< 5dim iV ^~2 /l - 1 ^^ i{1) , 

which corrects the log of the contribution of the SYTx to log dim at ip by subleading terms 
as one expects based on the observation above. 

Although we managed to get ample information about the SSYTx ip in the case h ^ N, 



we still lack the information we need: dk the degeneracy of the YDs ipk in (3.3) and their 
type. Using the discussion of the Kostka numbers above as a guide, and the map between 
SSYTx ip and one expects that one can also take as representatives of the YDs (fk 
appearing in the decomposition of O ® ip the ones that we get by adding at most one box 
to each row of O. These are the "duals" of the SYTx ip according to our map between 
SSYTx and YDs (fk, and we will denote them from now on by ip®. The reasons that we 



1 I am very grateful to Robert de Mello Koch for pointing out a mistake in an argument related to this 
point in a previous version of this paper. 
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can do so can be summarized as follows. First of all, the leading terms of log of their 
degeneracy and their number A/"o which are given by: 

log d° k n h log — , log Wo « h log ^ , (4.8) 

do no change once we take into account the the YD and the antisymmetry rules. See section 
F.l of appendix-|] for more details. Secondly, the leading term of the log of (ipk fk)oi that 



appears in the explicit form (3.4) of the two point function ( |3.1D , is the same for almost 



all the YDs ifk, see section 5.1 for more details. Lastly, the degeneracies d^ are maximized 
for this kind of YDs. This is because dk are related to Kostka numbers and these YDs are 
associated to the filling f3 = (1) which maximizes the Kostka number. We will come back 
to these issues and others when we discuss the full two point function of these probes in 
subsection |5.1.l| . 

4.3 The tensor product O ® ip: II-The cases h ~ N and h -C N 

As previously mentioned, when studying the decomposition of the tensor product O ® tp, 
we are solely interested in the generic probes. Let us first look for a lower bound for the 



maximum possible value of K a i p using equation (4.4). Since it is the maximum, it is bigger 
or equal to the average value of K^, p in this equation. To calculate this average we need 
to know the total number of terms in (4.4). Using the estimate (4.5) an over-estimate of 
the total number of terms is given by: 



N 

Mot =J2 C N C h-l < ^ 
k=0 



The combination of this estimate together with equation ( [4.4|) , the fact that log dimjvV' 



h log ./V (see subsection 2.2), gives the following approximate value for K™ ax the maximum 



value for the Kostka numbers: 

log K$ ax ps log dim V ~ h log N . (4.9) 

This means that there are some SSYTx -0* whose Kostka number is big enough to give 
rise to the leading term in dim^v W- What are the fillings of such SSYTx ip*l According to 
our discussion in appendix-[|| the fillings /3* are such that all the labels 1,2, ... , N are 
present with almost equal frequency. 

Based on this results, one expects that there are few 12 YD tp* k in the decomposition 
( |3.3| ) whose degeneracy d^ has the leading behavior: 

N 

log dk ~ log dinijv ip « h log — . (4.10) 

Vo 

Once again, we need to check the effect of taking into account the YD and the antisymmetry 
rules. We will discuss the inclusion of the YD rule here as it is relatively easier to discuss 



12 Few in the sense that the leading behavior of the log of their number is subleading with respect to 
h log TV. 
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than the inclusion of the other rule, the antisymmetry rule, which will be dealt with in 
section [F,2| of appendix-^. 

The fillings of these special SSYTx tp* include all the numbers from 1 to N. So, the 
potential associated YD (pt will be obtained by adding boxes to all the rows of O. The YD 
rule than implies that the length of the new rows should be decreasing from top to bottom. 
The invariance of the Kostka numbers t p under the reshuffling of the entries of the 
filling /3 comes to our rescue, see appendix-^. Using this invariance, we choose the ordered 
filling j3*: f3\ > /3 2 . . . > /3jy to correspond to one of theses possible ip k . At this point one 
might be tempted to declare victory and conclude that taking into account the YD rule 
does not change the results obtained through the Kostka number means. However, one 
should remember that the YD rule is not limited to the final (pk, but it is enforced after 
the addition of each box to the YD O. Although we could not come up with a satisfactory 
argument to why the final conclusion will not change even after fully taking into account 
the YD rule, it does not seems to be that crazy to conjecture that this is the case. A piece 
of evidence has to do with the entries ft* of the filling f3*. Remember that for maximum 
Kostka numbers in our case of interest we have (see appendix-]!] for details): 

Vi€{l, 2, ...,#} ; 

Using that for our probes h <C N 2 , we conclude that j3* <C N. This implies that it is more 
probable for boxes to end up in different rows, than in the same row. So, one can think of 
the cases where the YD rule is violated as organized constructions in contrast to the cases 
where it is satisfied, which can be thought of as random constructions. As a result, one 
can conclude that the number of cases where the YD rule is violated is subleading with 
respect to the total number of possible cpt. This conclusion is in conformation with the 



assumption that the leading term of log ((ip tp))o is continuous, see subsection 5T for more 
details. 



The inclusion of the antisymmetry rule is more involved and is left to section- F. 2 of 
appendix-[F] as already advertised to. The final conclusion is that in the decomposition of 
the tensor product O ® ip, for ip a generic probe and h ~ N or h S> N, there are special 
YDs ipt whose degeneracy d* k reproduces the leading behavior of dim^v ip, which is given in 



(|4.10|) . Restricting ourselves to one of these YDs will give us the leading order of the log 



of the two point function (pTfl) that we are after. For more details see subsection 5.1.2 



5. The full two point function and its universality 

After we managed to gather the information we need on the decomposition of the tensor 
product O ® tp, we are almost ready to answer our main question of this paper: evaluate 
the leading term of the two point function ([O]) whose explicit expression is given by (|3.-j|). 
The only remaining step is to evaluate the quantity (ip^ (fk)o- Our aim in this section is 
to evaluate the latter quantity, then get the leading term of the log of the full two point 
function (|3.1|) . The discussion will depend on the type of the probe as well as the scaling 
of h with N. We will start by discussing the involved case of the generic probes, then move 
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on to the simplest cases of the linear and long probes. But before that, let us first deal 
with the general expression of (ip k <pk)o- 

To deal with the quantity {(p k y>k)o-> we need to introduce some notations that we 
will be using throughout this section. We will also remind ourselves of old conventions for 
convenience. Oi is the length of the row i of the background YD O. For almost all the 
values of i, Oi ~ N. Remember that i runs from 1 to N. d is the number of columns of 
if), n is the number of its rows, h is the total number of its boxes, tpo = Max{n , d}. 6^ 

denotes the number of added boxes to row i of O to form the YD <p k , and rig stands for 

(k) 

the total number of non-zero <5 4 . The ordering rule implies that: 

n s >n, VA;,Vi; <*f } < d . (5.1) 

These inequalities are also a straightforward implications of the properties of the SSYT 
labeling in conformation with the equivalence between the ordering rule and the SSYTx 
tp. These inequalities will play an important role in the following. 
Our starting point is equation ( |3.7p adapted to the present case: 

N 

log (<p k y k ) ^-h + Y,(N + O t + 6^ - i) log(JV + O t + bf - i) 

N 

-J2(N + Oi-i) ]ag(N + Oi-i) , (5.2) 

i=l 

To perform the sum above, we need to specify which probe and regime we are in. This will 
be one of our tasks in the remaining of this section. 

5.1 The generic probes 

Remember that for this class of probes we have d <C N. This implies, according to 05.11), 

(k) 

that 5f J < N. Using that for almost all i's Oi ~ N, we can expand the log term in 
equation (|5.2|) and perform the sum to get: 

N 

log (<p k <p k ) *-h + J2 4 k) log(N + Oi - i) « h logiV . (5.3) 
i=l 

A legitimate objection to the use of this result in the expression Q3.4| ) is that the rows where 
Oi <C N might spoil the final result. An in depth discussion of this point depends on the 
type of dominant YDs ip^, which in turn depends on whether h <C N, or (h ~ , fe> N). 
Since the leading term of the degeneracy d k depends also on this splitting in the regimes 
of h, we will specialize below to these two distinct cases for the evaluation of the full two 
point function ( |3.lD . 

5.1.1 The h < N case 

We concluded at the end of subsection that in this case a good set of representative YDs 
ip® are the ones resulting from adding at most one box to each row of O. The number of 
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these YDs A/"o as well as their degeneracy have the leading terms that are given in equation 
Q4.S| ), reproduced here for convenience: 

log (f k « h log A , logA/'o ps ft log ^ . (5.4) 
^0 " 



We need to settle the question of the usage of the expression ( |5.3| ) for all <Pk)o i n 
the expression ( |3.4| ). The issue as pointed out earlier is that the behavior Oj ~ N used 
in deriving this expression is not valid always. Although the ratio of the number of such 
rows in O to the total number of rows N tends to zero in the large N limit, which is the 
limit we are interested in, one still should check that things do not go astray. First of all, 
notice that we only need to worry about these rows in the case where the number of boxes 
added to these rows is a finite fraction of the total number of boxes ft. This is because we 
are interested in leading order terms. From the properties of the background YDs O, see 



section 2.1 , one easily infers that these problematic rows sit at the tail of the YD O. Let 
us denote their number by m, m <C N but can be as big as we want. The probability that 
a box from ip ends up in one of these rows is p = (m/N). Let us suppose that <j) is a YD 
among the dominant ip^ YDs where («ft) boxes from ip are added to some rows among 
these m rows. Then the probability of finding such a YD among all the available YDs ipk 
is bounded from above by: 

V < C Kh p Kh ( 1 _ p )(i-«)fc_ ) . 0j for jy^oo, 

as one expects. This is an upper limit since we did not take into account that we still have 
a further contribution coming from the ratio between the total number of possibilities to 
distribute ft boxes on N rows and the number of possibilities to distribute (k h) boxes on m 
rows and the remaining boxes on (N — m) rows. This contribution reduces the probability 
further, but its effect is small, hence can be neglected. 

As a result, when evaluating the contribution of the YDs <p® to the leading term of 
the two point function (|3.1|) using the explicit expression ([334]), we can safely use (^3) for 



{ip® tp°k)o-> the degeneracy cP k as well as the number No given by equation (5.4). At the end 
we get the following leading behavior of this contribution 



log((<0 ip))o ~ h log N + 2ft log — + h log — ss h log 

ip h 



"0 



We need to worry about possible corrections to this leading behavior due to the other YDs. 
For this we need to remember that the degeneracies c4 are maximized by c$, and that the 
leading term of the maximum of (ip^ (pk)o is the value used for (ip® tp®)o- The final piece 
of information that we need is that the number of all possible YDs ipk is bounded from 
above by: 

Jf = C%2 h ps 2 h Nq , 



see subsection 4^ for more details. Using all these information, it is easy to conclude that 
the correction to the log of the full two point function ((tp ip))o when we include all the 
other YDs <pk is subleading. Hence, the log of the two point function for a generic probe 
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with h <C N in the background of a typical state O of the superstar ensemble has the 
following leading term: 

2" 



h N 

log ((■)/> V ; ))o ~ ^ log AT + 2h log — + h log — « /i log 

ipo h 



(5.5) 



This expression should be understood as follows. We should only take into account the 
leading behavior of the terms inside the log, whereas for h outside the log, we keep only its 
leading term. Notice the huge difference between this leading term and the corresponding 
one in the case of the backgrounds discussed in subsection [l^, given by equation (|3.8|) . So 
this class of probes ip probe deep into the geometry, which allows us to distinguish between 
typical states and a random one. Notice that this distinction survives even in the case of 
near typical states, see the conclusions for more details. 



5.1.2 The h ~ N and h > N cases 

In these cases there are few dominant YD ip k that are the result of adding 5. 
boxes to each row of O. Their degeneracy is such that its leading term is: 

N 

log dt ~ log dim/v ip ~ h log — . 



(h/N) 



(5.6) 



See section |4.3| and appendix-^ section F\2 for more details. The reason we can use expres- 
sion (|5.3|) for {tp* k <p* k )o is much easier to understand in the present situation. Due to the 
way we build the dominant YDs tpt, one easily concludes that the number of boxes added 
to the m rows whose length Oi <C N is less than: 



h l 

— to log m 

N 



in 



— h logiV < h logiV , 



which implies that the leading order of fk)o is the same as in the equation (5.3). 

Now using the fact that dim^v ip is an upper bound on the total number of YDs (pk, the 
leading term of the degeneracy dt of the dominant YDs (p k , as well as the leading behavior 
of (ipf, (Pf.)o, w e find the following leading term of the two point function (EO): 



log ((tp ip))o ~ h log N + 2 h log 



JV 



h log 



N 



N 



(5.7) 



Once again, we only care about scaling with N for what is inside the log, and for h outside 
the log we only keep its leading term in this expression. Using the same ideas as in the 
previous case, we can show that the inclusion of the other YDs ipk adds only subleading 
corrections to this formula. It is also the case here that this leading term is different than 
the corresponding one discussed in subsection 3.2 and given by equation (3.8). Combining 



this observation with a similar on in the h <C N regime, we conclude that the generic 
probes are efficient in distinguishing typical states from other backgrounds as well as other 
states in the superstar ensemble. For more details see the conclusions. 

Another remark worth mentioning has to do with the analytic properties of the leading 
behavior of the log of the two point function of the generic probes. Comparing the expres- 
sions ([5.5D and (pV^), it is easy to see that the leading term of log ((tp ip))o is continuous 
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for all values of h, but not differentiable at h ~ N. The continuity of the leading term is a 
result of the continuity of the two point function since subleading terms remain subleading 
when we change h. This continuity gives a further support of our conjecture that the 
leading term of the degeneracies dt is given by the leading term of the associated Kostka 
number. See section |4.3| as well as appendix-^ section F.2 for more details. On the other 



hand, the non-differentiability of log((^ ip))o happens at the point h ~ N. This is a point 
since the only difference between the expressions (5.5) and ( |5.7| ) resides in the term inside 



the log, and for such terms we only keep their scaling with N and not the full leading term. 
So we see a sign of a phase transition when the number of boxes h is of order N. This can 
be attributed to the fact that the gravity dual of the YD ip with h <C N is best thought 
of as gravitons, however when h becomes comparable to N, the duals are either giant or 
dual-giant gravitons [|||, 35]. 

5.2 The linear probes 

The discussion of this case is easier and this has to do with the fact that log dim^r ip ~ h in 



this class of probes. Let us first look for tpk that maximizes the expression (5.2). It is easy 

(k) 

to see that this expressions reaches its maximum when the quantities o| are maximized 
and the range of i is restricted to lie in the upper rows. A the solution to these two 
requirements is Si = ipi. The associated YD always exists. There are other YDs that 
are a slight modification of this one, but they do not always exist and furthermore they 
do not change the conclusions to be reached here. Hence, we will just proceed with the 
YD (fQ. We start from the expression Q5.2| ). To evaluate the sum, we need to distinguish 
between the two cases d ~ N and n ~ N . The first case is the easiest one. In this case we 
have n <C N, hence we can neglect the correction coming from the presence of i in the log 
terms in the expression ( |5.2| ). Next, we use the fact that Oi ~ ipi ~ N to find the following 
leading term: 

log (990 Po)o~h log A . 
What about the second case n ~ N . In this case we have ipi <C N. Taking advantage once 



again of the fact that most of the contribution to the sum in (5.2) will come from terms 
with d ~ N and (JV - i) ~ N, we get: 

log(v? Vo)o ~ h logiV , 

once again. Notice that here we used that Si <C N to expand the first log term. So, all in 
all, the largest possible value of (iff. <Pk)o i n the present case has the following leading log 
term: 

log (ip <fo)o ~ h logiV . (5.8) 

After we found the maximum of (tp^ <fk)oi we need now to get the leading term of 
the expression ( |3.4|) . This is easily done taking advantage of the fact that J\f the total 
number of YDs ipk is much smaller than (ipo (po)o- The idea is to look for a maximum and 
a minimum of the sum in (|3.4j ). A more than enough bounds are: 



(vo tpo)o < {{tp ip))o < AA x (ip Q ip ) o 
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Now using that: 

log TV < log diniAr Y> ~ h < (<p (fo)o , 

we find: 

Iog((VV»0 -/ilogiV. (5.9) 

Two observations are worth mentioning at this point. First of all, the linear probes are very 
bad in probing the states of the superstar ensemble if we restrict to leading order. This 
turns out to be case for the long probes as well as we will see below. This is because we get 



the same leading behavior as in the case of the background Q discussed in subsection 3.2 
see equation ( |3.9| ), even though the latter is not part of the superstar ensemble. This is 
because the linear probes probe the outer region of the geometry which is pretty much fixed 
by its asymptotic AdS§ x S 5 geometry. The second observation is that the expressions (|5.5|), 
Q5.70 , and the expression (|5.9| ) can be continuously connected to each other. However, due 
to non-differentiability at the "meeting" point -00 ~ N, it seems that each class of probes 
have a different gravity duals. It could also be that part of this non-differentiability is 
associated to the fact that the probes are probing different regions of the background 
geometry. 

5.3 The long probes 

For these probes, we will use the same idea as in the linear probes case. This is because the 
total number of the YDs ipk is "very small" . Actually, we can even get the first subleading 
term in this case. We start by maximizing ^k)o- Once again, the YD ipo for which 
df 1 ^ = ifii will do the job. Next, we need to evaluate the sum in expression (5.2). To do so, 



remember that in this class of probes ipi ~ d S> N and n -C N. It is straightforward to 
find the following leading term: 

log (<fo fo)o ~ h log d - (7 + 1) h , (5.10) 

where 7 is an ^-independent constant that depends on the actual shape of the probe ip 
and is given by: 

1 n d 



To get the leading term of the two point function ( |3.4| ) , we use precisely the same trick 
as in the linear probes case. Remember that in this class of probes the total number of 
YDs ipk satisfies the inequalities: 

log M < log dimAr i> ttnN \og(d/N) <C h <Clog (v? fo)o ■ 

Using these in equalities, we find the following leading term: 

log((V?V)>0~^logd-(7 + l)/i J (5.12) 



where 7 is given by equation ( |5.11 ) above. This leading order is the same as the corre- 



sponding one for the background Q given by equation ( 3.1C| ) in subsection 3.2. This once 
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again confirms our intuition that long and linear probes are very bad in distinguishing the 
details of the background geometry as they probe regions outside the core, and hence have 
the same response to all backgrounds with the same asymptotic. 

Looking at the expression ( |5.9| ) for the linear probes in the case d ~ N and the leading 
term of the expression ( 5.1 2| ) above, it is easy to see that we can unify them in a single 



simple expression that reads: 

log((V V»o ahlogd, (5.13) 

which has no explicit dependence on N. This suggests that for these type of probes it 
might be better to see them as fields in an U(d) theory rather than an U(iV) theory. We 
will come to this point in the conclusions. 

Notice that the leading terms of the log of the two point function ((tp ip))o fo r the long 
probes and the linear probes for d ~ N can be connected continuously. Given that we can 
get the first subleading term in the case of log probes, one is tempted to use continuity to 
derive the corresponding first subleading term in the case of linear probes. Unfortunately, 
we cannot do so as some terms that are further subleading for the long probes, become 
comparable to h when we take the limit d ~ N '. 

6. Discussion and conclusions 

The fuzzball proposal for black holes [||, ||, [?| has a lot of potential to solve the black hole 
puzzles. Even though there are several tests of this proposal (see the reviews jj], || [| 10 



11, |T^, [D| and the references therein.), it is not fully clear yet whether it is right or wrong, 
or needs just a reformulation. In this paper we started a further test of this proposal in 
the case of the superstar of [^0[ and its conjectured ensemble the superstar ensemble that 
was introduced in |l]]. We probed the different typical states of this ensemble using light 
half-BPS probes. Of course, to complete the test one needs to do the dual calculation in 
the superstar geometry. We hope to come back to this in the future. 

Although what we did in this paper was half the needed job, the results obtained 
strongly suggest that the conjecture of Jl) might survive. We find the following universal 



leading term of the two point function (3.1): 



log((^»0~a/ilogiV, (6.1) 

where h is the energy /conformal weight of the probe tp, N the flux of the background we 
are probing, and a is a constant that depends heavily on the shape of the probe tp and 
on a lesser level on the scaling of h with N, see equations (fj), (0)> (H)> and Q )- 



By the shape of the YD tp, we mean which of the two numbers: the number of columns 
d or the number of rows n dominates in the large iV limit. The dependence of a on the 
scaling of h with N in the generic probe class can be understood on the gravity side as a 
phase transition from gravitons in the case h <C N to (dual-)giant gravitons when we hit 



the h ~ N point g 



One can also understand intuitively the dependence of the two point function ((tp tp))o 
on N h as follows. One can think about each box as a free graviton in our background. The 
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only thing that the graviton will see is the mass iV 2 of the background. So one expects that 
the two point function of a single free graviton to be proportional to iV 2 . So, once we are 
dealing with h free gravitons, we should get something proportional to N 2h . However, this 
result should be modified since the more gravitons we have, the less free they become. This 
interaction between the gravitons should modify the two point function. Intuitively, one 
can think about turning on the interaction as effectively reducing the number of gravitons. 
Hence one expects the two point function to take the form N ah . What about the value of 
a? can it be understood from the dual bulk perspective? 

Intuitively, one can relate the dependence of a on the shape of the probe ip to the 
effects of angular momentum. Remember that the presence of angular momentum gives 
rise to centrifugal forces that forces the particles to move away from the center. In the 
gravitons regime (h <C N) there is a competition between the gravitational attraction 
that is proportional to the energy of the gravitons h and the centrifugal force that is set 
by the angular momentum. The latter depends on the shape of the probe. So it is not 
that strange that a will depend on the ratio of these two quantities. Things are much 
cleaner when we move into the (dual-)giant gravitons regime (h ~ N) or the regime of a 
bound state of gravitons and (dual-)giant gravitons (h ^> N). Take for example the case 
d S> n. In this case the dual probe is a bound state of n dual-giants in the AdSs part of 
the geometry with a very large angular momentum which is proportional to d. Since the 
radius of the dual-giant is proportional to its angular momentum [35], these dual- giants 
probe different regions of the background geometry depending on the value of their angular 
momentum d. One can use this fact to argue for the independence of the value of a from 
h in this regime. In the opposite case n 3> d, one talks about the giant gravitons inside 
the S 5 part of the geometry. In this case we have a bound state of d giants with angular 
momentum given by n. The dependence of a on n (and its independence from h) can be 
explained using the same arguments as in the dual-giant case. This is because the radius 
of the giant is also proportional to its angular momentum Notice that the invariance 
of our result, equations (|5.5|), (|5.7|), fl5.9|), under the exchange (rows f-> columns) can be 
seen as a manifestation of the symmetry (particle f-> hole) in the quantum hall description 
of the half-BPS sector of type-IIB string theory on asymptotically AdSsX S 5 ]29|, So 
everything seems to fall in place. 

We cheated a little bit when we declared that a did not depend on the background. 
Naively one would expect it to depend on the shape of the background. This is the case 
and this dependence is encoded in the tensor product decomposition of O ® tp, see section 
H] and appendix-^]. However, since we are dealing with typical states of an ensemble which 
we claim that it has a well defined effective gravity description Q, the leading order of a 
should have a definite value irrespective of the typical state we pick from the ensemble. 
This is precisely what we find. Actually we find more. If we try to move a little bit out of 
typicality, the value of a changes. This can be observed in the discussion of appendix-^. 
There when we discussed the possible modifications to a when we change the background 
YD O, we concluded that as far as the number of rows, after collapsing each set of equal 
length rows to just one row, remains of order N, a remains intact. This was only satisfied 
by typical states. We can actually construct YDs O that are very close to typicality but 
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with a different value of a. These are YDs O where the rows are grouped into N l ~ a sets 
of rows of equal length where each set has N a rows, or YDs that are close enough to these. 
It is clear from the discussion in appendix-^ that even for very small a <C 1, we still get a 
different value of a. The deviation is proportional to a, and hence small for small a. So, 
we can still talk about near typical states, but that does not rule out the fact that we still 
get a different value for a for these states. 

In relation to the discussion above, we have seen that long and linear probes are not 
very good in probing the background geometry when restricting to leading order. This is 
not that surprising since their duals, as argued before, are (dual-)giant gravitons that are 
probing the outer region of the background geometry since they have a very large angular 
momentum. Since this outer region is pretty much fixed by the asymptotic AdSs x S 5 and 
the background flux, we expect that almost all the states in the superstar ensemble to lead 
to the same two point function for these probes. Actually one expects more, one expects 
that this leading term should be the same for all backgrounds with the same asymptotics 
as the asymptotics of the superstar ensemble states. This is backed up by the results 
obtained for the states Q introduced in subsection ^2. So for these type of probes we need 
to include the subleading terms to see the difference. As one expects, we need to go to 
higher subleading terms for the long probes, whereas for the linear probes it seems that 
the first subleading term will be enough to see some differences emerging. However, to 
distinguish between different typical states, it seems that we need to go even higher in our 
"expansion" . 

One observation that was made at the end of section 5.3 and deserves a further dis- 
cussion has to do with YD tp whose number of columns d^> N. It was observed that the 
leading term of the log of two point function ((ip ip))o given by equation G5.12|) did not 
depend on N explicitly. Taking into account a similar formula for d ~ N, equation (5.9), it 
was mentioned that these long probes will have the same leading order if we were dealing 
with backgrounds with flux d. Which suggests that one can pretty much conjecture that 
these probes have a non-trivial backreaction on the background. This could be the case 
when interpreting these probes as d giants in S 5 . However, the picture is more obscure 
on the dual-giants side. It could be very well the case that since the dual-giant are close 
to the boundary, one should be careful about interpreting calculations on the dual field 
theory, see for example [36]. On the other hand, if we would have naively used very heavy 
probes h S> A 2 , we will end up always with these kind of YDs, since the number of rows 
is bounded by N, we will safely conclude that one should be dealing with a different dual 
field theory since these probes are heavier that the background and their backreaction 
should be taken into account. This gives a nice explanation of the "absence" of states of 
conformal weight h S> A 2 in N = 4 SU(iV) super Yang-Mills theory, even though there 
are no good reason for that. It is just that these states belong to a different theory when 
using AdS/CFT duality. 

Finally, there is a very important property that we did not manage to find a trace of it 
in this paper. This has to do with the fact that the effective dual geometry to the superstar 
ensemble is singular. In our analysis, the leading term of the two point function ( |3.lD was 
continuous across all the regimes of parameters of the probe ij). Although it was not 
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differentiable in all points of this parameter space, this non-differentiability was attributed 
to phase transition from gravitons to giant gravitons as well as to the fact that the dual 
gravity probes were probing different regions of the bulk geometry. One questionable point 
in this parameter space is when the number of columns d moves from being of order N 
or less to the case where d S> N. This is because the former probe inside the background 
geometry and the latter probe the outside regions. So, one is tempted to associate the non 
differentiability at the point d ~ N to this singularity. However, as argued above the long 
probes (i> N are kind of weird and one needs to be very careful when interpreting results 
that one obtains when using them as probes. Modulo this point, it seems that if there is a 
signature of this singularity, we need to go beyond the leading term or study higher point 
functions. We hope that future studies will help in elucidating this point. 

There are several things to be done to complete the discussion in this paper. The first 
one, which was mentioned already, is to do the dual calculation in the superstar background. 
Another direction of research has to do with the simple final result ( |6.1j ) that we got. Its 
simplicity begs for a better approach to derive it instead of our "tour-de-force" approach. 
Maybe a better use of the large N technology could help, especially the free fermion or 
matrix models language. A third open question has to do with beyond the leading term 
calculation. Unfortunately, the techniques developed here are limited to the leading order, 
and things get wild when going beyond except for long probes where we could calculate the 
first subleading term ( 5.12| ). Given that the long probes are the worst to use, this does not 



seem to be such an interesting progress. Maybe a different approach to the problem can 
help here. Actually, one of the original motivations to this work was to try to distinguish 
the typical states of the superstar ensemble. We found, as expected, that we need to go 
beyond the leading term for that. The question is then, at which subleading term can we 
start to distinguish some typical states from the others. According to the calculations in 
this paper, it seems that we might have some chance in the first subleading correction in 
the case of generic probes. However our technology that we used here comes short to this 
task. In the same line of thoughts, one wonders if there is a nice expansion parameter 
for log ((if) ip))o^ It seems that the general structure will be an alternation of terms that 
contain log N, and terms without this log. It does seem also that for generic probes, (1/tpo) 
might be the parameter of expansion, however we could not come up with a nice reason 
to why this is the case. It is possible that this parameter will work only for the first few 
subleading terms since it reflects the separation of the (dual-)giants in the background 
geometry, and one would expect that at certain stage the flux of the background will kick 
in. This happens for the second subleading term in the case of the long probes as the 
parameter of expansion seems to be (N/d). Actually, it is not even clear that there will 
be a nice expansion in the first place. Hopefully, further works tackling the same problem 
discussed in this paper using better technology will help shed light on these open questions 
and others. 
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A. Conventions and notation 

In this appendix we will summarize our conventions and notation that is used throughout 
this paper. 

Young diagrams (YD) Throughout this paper we will use YD as a short notation for 
a Young diagram and YDs for more than one. A YD is a collection of boxes, arranged 
in left-justified rows, with the row lengths weakly decreasing from top to bottom. For 
example: 



is a YD. We start enumerating rows from top to bottom and columns from left to right. Our 
main use of YDs in this paper is through their connection with irreducible representations 
of the unitary and permutation groups. 

The shape of a YD is the collection of the lengths of the rows of the YD into an n-tuple 
of integers. For example the shape of the previous YD is rp = (4 , 3 , 2 , 2). We will be 
using the same symbol to denote the YD as well as its shape. 

The hook lengths of a YD A box in a YD is denoted by the pair (i , j) where i is the 
number of the row and j is the number of the column whose intersection is this box. hu j) 
stands for the hook length associated to the box (i , j). The hook length associated to a 
box is the number of boxes below and to the left of it plus one. As an example, we depict 
below a YT where its boxes are filled with their associated hook lengths hu^y. 
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The product of all the hook lengths plays an important role in the formulation of the 
dimension of the SU(iV) group or the permutation group representation given by this YD, 
see equations (|2.12| ) and ( p. 13 ). 

Young tableau (YT) Throughout this paper we will use YT as a short notation for a 
Young tableau and YTx for more than one. A YT is a YD where each box is filled with 
a number 13 . The collection of these number is called a filling, and will be denoted by a 

13 This is not the only possibility as we can use letters for example to label the boxes. However we will 
be mainly interested in labels that are strictly positive numbers 
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Greek letter e.g. f3. We will use the word labeling of the YD to refer to all possible fillings 
j3. It is more convenient to denote a filling j3 by f3 = {j3\ , j$2 , ■ ■ • An), where denotes 
the number of times the integer i appears. We denote by |/3| the length of the filling (5 
which is the number of its non-zero entries /3j 7^ 0. As an example, the following YTx: 
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correspond to the YD of the previous example with the filling (3 = (2, 3, 0,3, 2, 0,1). 
The length of this filling is |/3| = 5. 

Standard Young tableau (SYT) Throughout this paper we will use SYT as a short 
notation for a standard Young tableau, and SYTx for more than one. Let h be the number 
of boxes of a YD under consideration. A STY is a YD whose boxes are filled with numbers 
1 , 2 , . . . h, each one occurring once and such that the numbers on the same row are strictly 
increasing from left to right, and the numbers on the same column are strictly increasing 
from top to bottom. The following YTx depict some examples of SYT. 
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The number of all SYTx associated to a YD tp gives the dimension of the representation 



of the permutation group given by this YD ip, see for example [27] 



Semi-standard Young tableau (SSYT) Throughout this paper we will use SSYT as 
a short notation for a semi-standard Young tableau and SSYTx for more than one. A 
SSYT is a YD whose boxes are filled with positive integers such that the numbers on the 
same row are weakly increasing from left to right and the numbers on the same column are 
strictly increasing from top to bottom. The following YTx are examples of SSYT. 
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The number of all possible SSYTx associated to a YD ip, whose labels are integers running 
from 1 to A is the dimension of the representation of the group SU(A) given by this YD 
ip, see for example [p7f| . 

Kostka numbers The Kostka number t p is the number of SSYTx associated to the 
YD of shape tp and the filling f3. 
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Background YD conventions In this paper we will use the letter O to denote the 
background under consideration, be it a state, a geometry, or a dual YD. We will denote 
by Oi the length of the row i of the YD O. We will reserve the notation Oq to the limit 
shape YD of the ensemble under consideration. The energy/conformal dimension/number 
of boxes of the background will be denoted by A, whereas the number of its columns will 



be denoted by D. For characteristics of these backgrounds see section 2.1 



Probe YD conventions In this paper we will use the letter ip to denote the probe, 
be it a state, or a dual YD. Its energy/conformal dimension/number of boxes will be 
denoted by h. ipi will denote the length of the row i. We will also reserve the letter d to 
denote the number of its columns (d = ip\), the letter n to denote the number of its rows, 
ipo = Max{n, d}, and the letter H to denote the product over all its hook lengths. See 
section 12.21 for characteristics of the probe YDs that will be used in this paper. 



The dimension of a YD Two dimensions associated to the probe YD ip will play an 
important role in this paper, dmi/v ip will denote the dimension of ip seen as an irreducible 
representation of the group SU(iV), whereas dim/j if) stands for its dimension seen as an 
irreducible representation of the permutation group Sh- 

Leading behavior vs leading term Two notions that will be used extensively in the 
paper: leading behavior and leading term. By the leading behavior of a quantity A we 
mean its scaling behavior with N. We use the symbol "~" to express the leading behavior. 
On the other hand, the leading term of a quantity A means the dominant term in an 
expansion of A in the large iV limit. We will use the symbol "~" to express the leading 
term. For example: 

- 1 1 

A = ^2k 2 = - N (N + 1)(2N + 1) , A~N 3 , A « - N 3 . 

k=i 6 3 



B. A collection of some useful formulas 



Most of the results derived in this paper rely on formulas that will be discussed in this 
appendix. The first kind of formulas has to do with approximating sums of the form: 

n 

Si(h , a ; n) = ^^(h k + of \og(h k + a) , 

k=0 

for n 1, and £ < 2, which will be the topic of the first section below. The second formula 
has to do with counting the possible ways to partition a positive integer n into m positive 
integers nf, i = 1 , 2 , . . . m, where the order of the index i is important. We will be more 
precise about what are we really counting in the second section below. In the last section 
we will discuss some useful properties of polylogarithm function hi n (x) that will be used 
in appendix-^. 
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B.l Sum approximation 

In the following we will discuss an approximation to sums of the form: 



Si(h , a; n) = y~^(/t k + a) e log(/i k + a) , 



(B.l) 



k=0 



for n> 1. There are different approaches to deal with this kind of sums. We will use the 
Euler-Maclaurin approximation which is valid for all smooth real functions f(x). We have 
(see j37j] for example): 



J2f(hk+a) 



k=0 



f(x) dx-\ 



M±M +± |^ B 2 k (/<' (b) - ^ (a)) + K h 

(B.2) 



where b = a + hn, and IZh is the remainder and is given by: 



h 2p+2 



n-1 



Uh = wrw. B2p+2 £> /(2p+2) (a + h k + e) ' 



(B.3) 



where 9 is some constant between and 1 i.e. < 6 < 1. It can also be shown that 7Z is 
bounded by: 



\TZ\ <~ 



2Q{2p) 



\f^\x)\dx 



(B.4) 



(2tt) 2 p 

where £(t) is the Riemann zeta function. The constants B n that appear in the relation 
( B.2D are the Bernoulli numbers with B\ = 1/2. Their generating function is given by: 



m=0 



m\ 1 — e 



:= B(t) . 



(B.5) 



It is clear from the generating function above that if n > 1 then Bm+\ = 0. The easiest 
way to see this is by using the observation: 

B{t)-B(-t) = t. 



To apply the Euler-Maclaurin formula to our sum (f^l), we need to calculate the 
integral as well as the derivatives of the function f(x) = x f ~ logx. We have: 



1 



J+l 



f(x) dx 

f( m )( x )=e\ iogx + ... 



log X 



+ 1 



X 



t-i 



log x + ^2 



+ 1-J 



for i < i 



The last ingredient we need is the values of the first few Bernoulli numbers. We have: 
Bo = 1 , Bi 



~ 1 Bo = — 1 Ba 
2 ' 6 



1 

30 
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Putting everything together we get the following approximations to the first few sums: 



S {h, a; n) ~ - 
S\{h, a; n) ~ — 



S 2 (h, a; n) ~ — 



6 + ^ log 6 - - ~ J log a - /? 

h 2 \ ( h 2 

b(b + h) + — log 6 - a (a - /i) + — ) log a 



(B.6) 



6 



6 



-n(b + a) 



h 



h 



b(b + h) \ b + - j log b - a (a - h) ( a - - ) log a 

1 /i 2 
-- n (a 2 + 6 2 + a6) + -n, 



(B.7) 



(B.8) 



where we truncated the sums to the log order which is enough for our purposes. 

In the remaining of this subsection, we will use the approximations above to evaluate 
a double sum that will appear frequently in appendix-^. This double sum is given by: 



ra-l d-l 

log Ho = Y Y + 1 + * + ft ' 

i=0 j=0 



(B.9) 



where N is some positive integer. The idea is to use the special structure of the summand 
to change the double sum over i and j to a single one over k = i + j. For that we need to 
introduce the following quantities: 

a = Min {n , d} , b = Max {n , d} . 

Notice that a + b = n + d. It is easy to see that one gets: 

a—1 b a+b—1 

log H = Y k lo E(N + k ) + a Y + k )+ Y ( a + b ~ k "> log ^ + k "> 

k=a k=b+l 



k=l 



= H N {n + d) - H N {n) - H N {d) 
where Hp (a) is a short notation for the sum: 



(B.10) 



a-l 



Hpia) = Y(a~k) log(/3 + k) 



(B.ll) 



k=l 



It is a straightforward exercise to get an approximate expression for Hp{a) using equations 
( |R6|) and We get: 

a-2 a-2 

H p {a) = {a + p)Y M/3 + 1 + *) ~ £(0 + 1 + *0 log(/3 + 1 + fc) 

fc=0 fc=0 



Iog(a + p) 



1 



/3 2 + a(2/3 + l) 



log/3--a(2/3 + 3a) . 

(B.12) 
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Plugging in this result into the expression ( B.lCj ) of log "Ho above, we find: 
log Ho ~ ~ \ Ar + n + d-~\ log(AA + n + d) + ~ \ M - ~J log AA- 

-~ (V + n-£) log(AA + n)-i (V + d-i) log(AA + d) - | nd . (B.13) 
B.2 A counting formula 

We will need at different places in this paper to count the possible ways to partition a 
positive integer n into m positive integers nf, i = 1 , 2 , . . . m. More precisely we are 
interested in counting the m-tuples (ni , n% , ... , n m ) such that: 



Mi; rii>0, y~] 



in 

m = n . 

1=1 



Notice that this is different, and much easier, than the usual notion of a partition of 
integers. In the latter case the order of the index i is not important i.e. we are counting 
sets of integers instead of m-tuples. On top of that in the usual counting of partitions, we 
require the integers rtj to be strictly positive. 

The answer to our counting question is easy. It is the same as counting the number of 
ways to distribute n balls on m boxes. The answer to the latter question is given by: 

V m (n) = C n n + m _ x , (B.14) 

where C\ is the binomial coefficient: 

at "'■ 



b a\ {b -a)!' 
B.3 The polylogarithm function 

In this section we will summarize some of the important properties of the polylogarithm 
functions Li n (z), that will be of importance to us in appendix-|C|. Since, in appendix-]^, we 
will be dealing with quantities that are of the form e~^ x ~ Xy , where /?, x, A and y are all 
positive, we restrict the argument z of the function Li n (z) to lay in the interval < z < 1. 
We have by definition: 

OO 

U n (z) = J2^; Vn>0. (B.15) 

k=l 

It is easy to see, using the definition above that: 

f Lln ^ dy = Li ra+ i(z) ; for 0<z<l, (B.16) 

Jo y 

zd z Li n (z) = Li„_i(» . (B.17) 
One can use the second property to define polylog for negative n. We find: 

Li ( _ n) (z) = (zd z ) n Vn>0, (B.18) 
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where the case n = corresponds to the function that the operator(,z d z ) n acts on. 

The last property we would like to discuss is the Taylor expansion of the polylogarithm 
function whose argument takes the special form z = z$e a , where a <C 1. It can be easily 
proved, using the defining equation of the polylogarithm ( |B.15| ) that: 

n u 

Li n (zq e a ) - Li n (z ) ~ ^ — Li n _ fc (z ) + . . . . (B.19) 

k=l 

This expansion is more than enough for our purposes even though we truncated the sum 
to n. 



C. Thermodynamics of the superstar ensemble 

In this appendix, we will evaluate some thermodynamical quantities of our superstar en- 
semble of background YDs, see section 2.1. Our first step is to evaluate the log of the 
partition function Z defined in equation (|2.3j), using the approximation ( |B.2j ). We will 
check the validity of our approximation at the end. After that, we will move on to fix the 
values of the parameters f3 and A. All the expressions derived below can be found in Q 
with appropriate modifications. 

Our starting point is the defining equation of Z, equation (|2.3j ), that we rewrite here 
for convenience. 

N N 
3=1 3=1 

where Li n (x) stands for the polylog function defined in ( B.15| ). To evaluate the sum that 
appears in the expression of logi? above using the approximation ( |B.2| ), we need to take 
h = 1, a = 1, b = N, and f{x) = \j\\{pq x ). We have: 

/ Lh (p<f) dx=- / dy = - [Li 2 (pq N ) - U 2 (pq)] , (C.2) 

h logg J pq y logq 

where we used the change of variables y = pq x , then the property ( |B.16| ) of the polylog 
functions since both q and p are positive and less than one, and N is a positive integer. 
Next, we need to evaluate the derivatives of f{x). One can prove using the property ( B.17| ) 
and the definition (B.18) that: 

/ (n) (*) = (loggrLi (1 _ n) (pq x ) • (C.3) 

The proof proceeds by using once again the variable y = pq x , then the chain rule to change 
the derivative with respect to x to: 

d x = {logq) (yd y ) . 

Collecting everything, we find the following approximate expression for log Z: 
logZc^^- [U 2 (pq N )-U 2 (pq)]+~ [Lh (pq N ) +U l (pq)} 

oo 

+ Ew( lo ^) (2/C " 1) [^2- 2fe )(p^)-Li (2 _ 2fc) (p<z)] , (C.4) 
k=i \ >' 
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Although we managed to evaluate log Z, we need to check the validity of our approxima- 
tion. Luckily enough, our /3 = — logq is very small ]l| section 3.3]. In this regime we can 
safely neglect the sum over k in equation (C.4), to get the more manageable approximation: 

logZ^J- [U 2 (pq N )-U 2 (pq)]+^ [Li x (p q N ) + Li x (p g)] , (C.5) 



To evaluate A, we plug this expression in the relation fl2.4|) to get: 

(C.6) 

where we used the property ( p. 17] ). To make connection with a similar expression that 
appears in Q, we need to use the fact that Lii(x) = — log(l — x) and the identity: 

7T 2 

Li 2 (y) + Li 2 (l - y) = — - logy log(l - y) . 

b 

The expression ( |C.5| ) above together with the relation (2.5), gives for D the following 
expression 14 : 

D -^~ [log(l-p)-log(l-pg")] . (C.7) 
Observe that we can arrive at the same expressions as ( |C.6| ) and ( p.7|) by starting from 



( |2.4| ) and (2.5), then performing the sum following the same steps as in evaluating log Z. 
The value of p can be already fixed using the constraint ( |C.7] ). We find: 

l-q D 

p = T^^- (C ' 8) 



We need also a way to fix /?, however the expression ( |C.6| ) even after substituting p 
by its value is complicated and the solution depends on the regime of (3. We will take a 
slight detour here to have an idea about the regime of /?, then we will come back to its 
value later on. The idea is to use the constraint on the limit shape YD of this ensemble 
Q. Remember that we want the limit shape curve to be a line. This because the superstar 
maximized the entropy. The latter is associated to moving around the outer boxes of the 
YDs. So, naively one would expect that the typical YDs of these ensemble should be closer 
to a triangular YD |fflj. 

Following [PJ, we start by 15 : 



N 

y{x) = ^{cii 



where x is the coordinate along the rows that increases from the top to the bottom of YDs, 
and y(x) is the length of the row x. Using the expression for (q) that appears in ( p.5[ ), we 
find the following equation for the limit shape: 

(l-pq N ) q y +pq x = 1 . 

14 It turns out that keeping only the leading term is enough for our purposes. 

15 We have a different expression for y(x) than the one in |l| since we have different conventions. 
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One can easily check that the box (N, 0) is part of this limit shape curve. Requiring on 
the other hand that (0, D) is also part of this limit shape curve leads to the same value 
of p as before, equation ( |C.8[ ). Plugging this value into the expression above leads to the 
following limit shape curve equation: 

(1 - q N ) q y +{l~ q D ) <f = 1 - q D+N ■ (C.9) 

To get a straight line out of this equation, we need to bring down x and y from the exponent. 
Since both x and y are of order N, we need (ft N) <C 1 i.e. ft <C (1/N). If this is satisfied, 
one gets for p the following approximate value: 

p^ ° . (CIO) 
y D+M V ; 



The limit shape curve becomes in this case: 

' ~ N 

which is a straight line. 

To fix completely ft, one can try to use the fact that (ft N <C 1), expand the expression 
( |C.6| ) appropriately using equation ( B.19 ) and equate A to (N D)/2. However, one gets 



exactly the latter equality independently of ft. One could have anticipated this since the 
limit shape YD is a triangle and (A) should give to leading order the same number of boxes 
as in this YD, which is precisely (N D)/2, a value that is independent of ft. 
Our last card is to use the entropy S of this ensemble. We have: 

S = ftA + \D + logZ^-N log(l -p)-D \ogp + C(log 2 q) ~ N . (C.12) 

Notice that there is no linear term in ft = logq, so the superstar ensemble extremizes 
the entropy. It can be shown easily by calculating the coefficient in front of log 2 q, that 
the superstar ensemble maximizes the entropy [jij. Using the relation between the inverse 
temperature ft, the energy A and the entropy S, we find |l]]: 

» = < C13) 

where we used that A ~ iV 2 . Finally, we should remember that in the expressions above 
we set the Plank constant to one h = 1 jjj]. Once we restore it, we find N logq <S 1 even 
though the scaling of log q with iV is log q ~ 1/JV. 

A final observation is in order here. A careful treatment of this ensemble and its entropy 
leads to an agreement between the latter and the entropy of the superstar of • This led 
JH to conjecture that the superstar geometry should be seen as an effective description of 
this ensemble, hence the name. For more details and evidences see ]IJ. 

D. A non-trivial simple family of Young diagrams 

In the following we are going to evaluate the dimension of a family of YDs that we call from 
now on the optimum family, that is generic enough for our purposes and at the same time 
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simple enough to allow for exact evaluation of the leading term of the log of its dimension. 
In contrast to the general discussion about the dimension of different probe YDs in section 
|2.2| where equation ( |2,11| ) was mainly used, we will instead be using equation ( |2,12| ) in this 
appendix. We summarized the results at the end of this appendix for the convenience of 
the reader. This allows for an easier comparison with the general discussion in section |2.2| . 

Let us first start by introducing this family of YDs. The rows of a YD in this family 
are grouped into groups of no rows of equal length. Two consecutive groups will have 
a constant shift in length given by do- The number of sets of equal length rows will be 
m < [N/no\ and the last rows will be of length d$. The total number of boxes is: 

h = —nodom (m + 1) . (D.l) 

Let us denote by < k < (to — 1), the group number and by 1 < a < tiq the row number 
in each group, then the row length takes the form: 

ipkno+a = d (m - k) ; if 1 < k < m - 1 , 

and vanishes for k > m. To make connection with our conventions in appendix-|A| notice 
that the number of rows of a YD in this family is n = n$ m, and the number of columns 
is d = do m. Notice also that the YDs in this family are homogeneous as can be checked 
easily even though not all of the columns/rows scale in the same way with N. 

We will start our discussion by looking for an approximate expression of the log of 
the numerator of equation ( 2.12j ) for this family in the case where N, no, do, are very 
large. From now on we will call this numerator num^pj. Before that, let us first fix the 
parametrization of the position of a box in a YD of this family. Since the YD is divided 
into m blocks of no rows and m blocks of do columns, one will need four integers i, j, s, 
and r where < % < m— 1, < j < m— 1 — i, 1 < s < no, and 1 < r < do to parametrize a 
position of a box. The latter will be characterized by its row number ino + s, and its column 
number j do + r. We will denote such information by (i , j ; s , r). Before continuing, let 
us also introduce a set of short notations: 

to = no + do , ip =m rh = d + n , d = dom , n = no to , 

N = N-n, N = N + d = N + mm . (D.2) 

We will subdivide this family of YDs into two subfamilies corresponding to the case 
to ~ iV and the case m ~ , a > and we will discuss them independently. We will 
call the first subfamily opto and the second subfamily opt^. Before specializing to the two 
subfamilies, we will approximate log num^j and log H op t by carrying out all the sums 
involved except for the sum that depends on to. 

As declared above, we start by evaluating log num^. Using the parametrization of 
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the position of boxes in V'opt we get: 



im — s) 



m—l m—l—i no do 

log num V> op t =^2 X] ^2^2 1 °S( N + J d + r ' 

i=0 j=0 s=l r'=l 

no-1 (fc+l)do-l 

]T Yl \og(N + kn + t + u + l) 

t=0 u=0 



m—l 
fc=0 

where we introduced the new variables k = m — 1 — i, t = uq — s, r = r' — 1, and 
u = doj + r. Notice that the sum inside the two brackets is of the form of the sum ( |B.S| ) 
whose approximate expression is given by (B.13). Using this observation, we get: 



m—l 



log num ip opt 



3 1 
--h + - > 
9 9 ^ 



k=0 



(N + (k + l)rh) 



(iV + (fc + l)m - n ) 



1 



1 

+ 2 



( 


iV 2 -- 


logiV - 


N 2 -- 




6 




6 



6 

logiV 



log(JV + (k + 1) rh) 
log(N + (k + l)m- n ) 



(D.3) 



Next, we need to evaluate log H pt- Once gain our starting point is the parametrization 
of the position of boxes in a YD of this family. It is easy to see that the hook length 
associated to a box (i , j ; s , r) is given by: 

h{i,j- s ,r) = [m- (i+j)]m + l - (r + s) , 
where 1 < s < no, 1 < r < do, and i + j < m — 1. So we need to evaluate: 

m-lm-l-i no do 

log% op f=^ J^J^log([m-(t + i)]m + l-(r + s)) 

j=0 i=0 s=l r=l 
m—l no — 1 do — 1 

= (no — fc) log(/c rh + 1 + u + v) 

k=0 u=0 v=0 

m—l 



3 , 1 

9 71 +9 Z^™"^ 



2 2 

(fe m + no 

^ m—l 

- J^(m- k) 



k=0 

6 



(A; + l) 2 m 2 



log [(A; + 1) rh] 



k=l 



log(k rh + no 
1 



(km + do)^ 



log (A; m + do 



,2-2 

fe m 



6 



log (A; m) 



(D.4) 



where we introduced the new variables k = m — 1 — (i + j), u = uq — s, and v = do — r to 
move from the first to the second line. The evaluation of the sum over u and v follows the 
same steps as in evaluating Hq defined in equation ( |B.9| ) whose approximate expression is 
given by equation ( [B. 13|) . 

We have two options in evaluating the leftover sum over k in the equations ( p.3| ) and 
( |D.4| ). We can either look for an approximate expression of the sum (|B.1| ) for £ < 3, perform 



-45 - 



the sum over k, then expand the final result depending on the regime of the parameters 
no, do, and m. Or, we can first expand the summands in equations ( p.3| ) and ( p.4[ ), then 
perform the sum over k using, at worst, the approximations ( |B.6| ), ( |B.7| ) and ( [B.8D . We find 
it easier to follow the second option and that is what we are going to do in the following for 
each subfamily. Another reason to follow such route is that we want to make our opt family 
of YDs as large as possible. In doing so the number m can be finite and small and hence 
the techniques of section |B.1| of appendix-^ will not be applicable. Before going ahead with 
our task, Let us introduce some notations for the sake of unifying the discussion below. 
We will assume the following leading behavior of no, m and do: 

no~nN u , m^mN^, d ~dN s , 
n^nfhN^ +u , d^mdN^ +s , h^hN 2 ^ +u+s , (D.5) 

where n, m, d are iV-independent and h = (1/2) n dm (m + 1). Remember that we are 
interested in the regime of parameters: 

2^+u+S <2 , v + n<l. (D.6) 

D.l The opto Family 

This case corresponds to u = 0. In the present case our regime of parameters 
simplifies to: 

v + 5 <2 , v < 1 , 



and since N enters in the expression ( D.3 ), we distinguish the following cases: 
• 1 < 5 < 2: This implies that v < 1, and hence m ~ do. We get: 

2 h ( m \ d 

log num-i/w ~ 8 h log N — h -\ > k log(k d) + mn N 1+u log — , 

m(m+l) I -/V 

(D.7) 

lo g H opt ~5hlogN-h+ 2h - I Vfc log(fcd) I + Us-v)N 2v \ogN , 



m [m + 



log dimAr ipopt «(i5-l)mn N 1+u log N rj n N log . (D.9) 



5 = 1: Once again we have v < 1 and hence m ~ do- We find: 

2h 



log num if) op t ~ /i log N — h + 



fc=i 

J71 



(D.10) 



dm (m + 1) 
m (m + 1) 

m 

log dim N ^ « - - V [(1 + A; d) log(l + A; d) - (k d) \og(k d)} . (D.12 

am (m + 1 f— f 

' fc=i 



\ogU opt ~h\ogN -h+ 2H (^fclog(fcrf) ) , (D.ll) 



\k=l 

2h 
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5 < 1: We need to distinguish between four subcases: 

* v < 5: This is the simplest case as rh ~ d$ as before. We find: 
log num ip pt ~ ft log N + h , 

2ft ( m \ 

log U opt « <5ft logiV - ft + — — V k log(fcd) 

m (m + 1 \ ^-^ 



m (m + 
N 

log diniAr Tpopt « (1 - 5) h log TV « ft log — - , 



\fc=i 



(D.13) 
(D.14) 

(D.15) 



log dinift Vopt ~ f ft. log iV + log 



1 1 ft 7 

ift log — + a ft 
a 



■ n m (to + 1} 



m (m + 1) 



fc=i 



(D.16) 



where a is some constant. The reason we included the next subleading term in 
the last expression is that v = is a valid point in our space of parameters. It 
is easy to check that this subleading term is positive when v = as it should 
be. This is because in this case n > 1 and: 

m m 

k log k < k 1°S m = -m(m + 1) log m . 
fc=i fc=i 

* = (5: This case is slightly more complicated than the previous one. We find: 

(D.17) 
(D.18) 

(D.19) 



log num ip pt sa ft log N + ft , 
log Hopt^ShlogN + Oih) , 

N N 

log dimAr i>opt ~ (1 - 5) ft log TV ft log — « ft log — 

an 



ft ft 

log dim/i V'opi ~ <5 ft log iV « ft log - « ft log - 

an 

where we used that ^ = 5^0. 
* 1 > v > 5: In this case we have to ~ no- We find: 



(D.20) 



log num ^opf w ft log N + ft , 



2ft 



log % op t ~ f ft log iV - ft H ? — V] fc log(fc n) 

m (m + 1) \ 



\k=l 



N 

log dim/v Vw ~ (1 - v) ft log N « ft log — , 

n 



log dim/j Vw ~ (5 ft log iV + log 



L 1 71 

ft log — haft 
n 



— dm (to + 1) 



(D.21) 
(D.22) 

(D.23) 

2 ™ \ 

— 7 — > fc log fc ft 

m (to + 1) J / 

(D.24) 



where a is a constant. Notice that the roles of uq and do here are switched with 
respect to the case 5 > u, even though the approximate expression ( |D.3| ) is not 
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symmetric under the exchange no do- This is a manifestation the duality 



discussed in section- 2.2.1 



* 1 = v > 5: In this case m ~ uq ~ JV. We find: 



log num ^opt pa /i log N — h 



2h 



mm (m + 1) 



- fcn) log(l - fc 



n 



,fc=i 



2/i 



log H op t « /i log JV - /i H 7 — - V" fc log(/c 

m (m + 1) V £^ 



n) 



(D.25) 
(D.26) 



log dirnA? ip opt 



2h 



Rfflfffl + 1) ^— ' 
fe=l 



— [(1 — kn) log(l — n) + (A; n) log(& n)] . 

(D.27) 



Notice that log dim jvV'opt is wen defined and positive since mn < 1 
D.2 The opt M Family 

In this case, we have the following range of parameters: 

0</i<l, 0< i/ + //< 1, <2fi + u + 5 <2 . 

The origin of the complication in this case is that we need to worry about d = mdo and 
n = riQ m on top of no and do . As in the previous subfamily we will use the different ranges 
of 5 as the main classifying tool. We have the following cases: 

• 1 < 6 < 2: This is the easiest case. We find: 



1 2 \ . r h 



log num^opt pa [h + nN - -n log V> + - — ^ log m 
1 2 / bm z 

1 \ 3 h 

n ( N n ) log iV h H h . . . , 

1 2 & 2 2m 



(D.28) 



1 



1 



h 



log Hop* ~ ( h - - n n ] log ^ + ^3 log m - - n n log n - - /i + — + . . . , 



6 m 



log dimjv ipopt pa (n + 8 -l)nfh N 1+v+lx log N pa nN log — . 



N 



(D.29) 
(D.30) 
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• 5 = 1: Following the same steps as before we find: 

log num tpopt ps (h + + n ^ ~ \ n ^ lo S^ 

5 + w)(^)^ + * } 

_ n fjV-in") logiV-^ + ^- + ... , (D.31) 
\ 2 / 2 2m 

, „, /, 1 \ , 7 h 1 3 , li 

log Uopt ~\h--n nj log tp + — f log m - - ra n log n - - h + — + . . . , 

(D.32) 

log diniiv ipopt limn N 1+ ^ +u log N ps n iV log . (D.33) 

• (5 < 1: As before we distinguish three subcases: 

* u < 5: We distinguish three possibilities: 

* 5 + \x > 1 : In this case we need to keep terms that involve do in the log 
expression. We find: 

3 

log num tf; opt ps (h + n N) log i/j - n N log N - - h + . . . , (D.34) 
log "Hop* ~ (h-^n n \ log ^ - ^ h + ^ + . . . , (D.35) 

log diniiv W w (<5 + /i - 1) m n iV 1+ ^ log N ps n TV log . (D.36) 

* 5 + jU = 1: Following the same steps as in the previous case, we find: 

2 

log num if; opt PS - ^ n 2 ^j log TV + ^ - t.^ h log(l + ■0) 

f + 4) ft+ "" (M7) 

log Uopt ~ f/i-^nonj log ^ - ^ /i + . . . , (D.38) 
1 

where ip = md. 

* 5 + (i < 1: We find in this case: 

log num tpopt ps h log N + /t + ... , (D.40) 

log ~ ^ - ^ n r?j logtjj -^h + ... , (D-41) 

TV 

log diniTv ipopt ~ (1 — /x — <5) /i log TV ~ /i log — , (D.42) 

log dinift ^ op i ps (i/ + jj) h log AT ps h log ^ . (D.43) 



log dimjv ^opt -7=^ [(1 + VO 2 log(l + V) " (VO 2 log V " W[ h , (D.39) 
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* 5 = v. We find: 

log num i(; opt ps h log iV - - /i + . . . , (D.44) 

logHopt~hlog$ + ... , (D.45) 

N N 

log diniAr ip opt ps (1 - /i - 6) h log iV ps /i log — ps h log — , (D.46) 

an 

log dim/j ^ op t RJ (/i + 5) /i log iV ps ft log - ps ft, log — . (D.47) 

an 

•k 5 < v. We distinguish two possibilities: 

* fi + z/ = 1: In this case we need to keep terms involving no untouched when 
expanding the log terms. Then, we evaluate the sum over k. We find: 

/ 1 — m n\ 2 / 1 3\ 

log num?/w ~ h 1°S N + I — — I h log(l — inn) + I -^-^ ] h , 

\ mn J \mn 2 / 

(D.48) 

log?V ~h\ogi>-^h + ... , (D.49) 

log dimTvVw ~ [(1 ~~ mn) 2 log(l — mn) — (mn) 2 log(mn) + (mn)l 

(mny L J 

(D.50) 

Notice that dimjyi/v* * s positive as it should be. To see that remember 
that 16 (mn) < 1. 

* u + v < 1: Here we expand the log terms keeping only TV. We find, after 
summing over k: 

log num ipopt ~ h log JV + • • • , (D.51) 

log Hopt Ps /i log^ - - /i+ . . . , (D.52) 

N 

log dim^r ipopt ps (1 — v — //) h log N m h log — , (D.53) 

n 

log dim h ip opt ~ (" + S) h log iV ps h log - . (D.54) 

n 

D.3 Summary 

In the following we summarize what we got in the previous two subsections. This will make 
it much easier to compare with the general discussion in section 2.2. To make contact with 
the expressions there we introduce the quantity: 

ipo = Max {d , n} . 

We will also borrow the classification of probes from there. Looking at the different cases 
above, we find the following leading behavior of log dimjv ip and log dim/j ip: 



16 It is easy to prove that the function: f(x) = (1 — x) 2 log(l — x) — x log a; + x, is an increasing function 
in the interval (0, 1] which implies that < f(x) < 1. 



- 50 - 



• Generic Probes: In the case \i = 0, these probes correspond to 5 < 1 and include the 
cases 5 < u, 5 = v and S > v. In the case \i ^ 0, these probes correspond to 6 < 1 
once again and include the cases (v < 6 , fj, + 6 < 1), v = S and (8 < v , v + /i < 1). 
In all these cases, we find that: 

log dinijv W ~ fa log — , log dim/j if ^ h log - — hall , (D.55) 
Vo Wo 

where a is some constant whose precise value is not of interest to us. 

• Linear Probes: In the case /i = 0, these probes include 5 = 1 and v = 1 cases. In the 
case /i 0, we have the cases (y < 5 , + <5 = 1) and (<5 < , z/ + // = 1). In all this 
cases we find: 

log dinijv W ~ & fa , (D.56) 
where 6 is some constant whose value is not important to us. 

• Long Probes: In the case fi = 0, these probes correspond to 5 > 1. In the other case 
fi ^ 0, the probes correspond to the cases 6 > 1, 6 = 1, and (<5 < 1 , <5 + /x > 1). In 
all of these cases, we find: 

log dimjv ip ~ n N log — . (D.57) 
These results are in complete agreement with what we found in the general discussion in 



section- 2.2. 



E. Some useful properties of Kostka numbers 

In this appendix we will discuss some of the properties of Kostka numbers a that will 



be very useful in subsections 4.2 and 4.3. In the following the filling fj has no zero entries, 



see the end of subsection 4.1.2 for more details 



E.l Kostka numbers and fillings 

The first property we are going to discuss has to do with the behavior of the Kostka numbers 
under the reshuffling of the numbers /3j's that define a filling fj. The claim is that the Kostka 
numbers are invariant under that and hence we can take fj the ordered counterpart of fj 
as a representative of these fillings i.e. start from a filling fj = {fj\ , , ■ . . , fj m ), and 
construct the filling fj = (fj\ , 02 , ■ ■ ■ , fj m ) such that: 

h > h > ■ ■ ■ > Pm ■ 

To prove the claim above it is enough to prove that we can construct a one-to-one map 
between the SSYTx ip with fillings fj and fj that are related by exchanging two numbers 
fa and fj i+ i. i.e. 

P = (Pi ) Pi ■> ■ ■ ■ ■> Pi ) Pi+1 Pm) , P = (Pi , Pi i ■ ■ ■ i Pi+1 , Pi , ■ ■ ■ , Pro) ■ 

Let us assume that we constructed all the SSYTx a with filling fj and we want to construct 
the ones associated to the filling /3. This is done in three steps: 
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1. First, we relabel by (i + 1) the boxes labeled by i, and vice versa. Notice that since 
we are not touching the other labels in this step, we need only to deal with the order 
of the labels i and + 

2. Next, we first deal with the order of the labels i and (i + 1) in each column. If the 
label i sits below the label (i + 1) we exchange their position, otherwise we leave the 
column unchanged. Notice that in this step if the column in not left unchanged then 
all we have done is switching the labels of only two boxes in this column. 

3. Finally, we look at the order of labels in each row. Once again we need to reorder 
the boxes with labels i and (i + 1) only. This is done by putting the boxes with label 
% that are to the right of the boxes with labels (i + 1) to their left. At the end of this 
step we get a SSYT a with filling f3 for each SSYT a with filling fa 

Notice that we could have done the same by starting with the filling fa then construct the 
semi-standard Young tableaux associated to the filling fa Hence the two Kostka numbers 
associated to (3 and j3 are equal. 

Take for example the fillings ft = (1 , 1 , 2 , 1) and f3 = (1 , 2 , 1 , 1) and the YD: 



The only SSYTx associated to the filling f3 are: 



1 


2 


3 


3 


4 





1 


CO 


CO 


2 


4 







2 


4 


CO 


3 





Let us follow the steps above to construct the SSYTx associated to /3. The first step gives: 
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The only YT that is affected by the second step is the last one, so we have: 
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Finally, the last step remedies the first YT which is the only non-SSYT. We get at the end 
the following SSYTx associated to the filling j3 = (1 , 2 , 1 , 1): 



1 


2 


3 


2 


4 





1 


2 


2 


CO 


4 







2 


4 


2 


CO 





It is not hard to see that the SSYT above are the only ones possible for the filling fa In 
the remaining of this appendix whenever we talk about a filling f3, we assume that it is 
completely ordered i.e. fa > fa > ■ ■ ■■ 
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E.2 The non-zero Kostka numbers 



The question we want to address here is when is the Kostka number non vanishing? Since 
Kostka numbers i p are related to SSYTx ip which are themselves related to decomposi- 
tion of the tensor product ip®B, where B is some arbitrary YD 17 , through the ordering rule 
(see section |4.1.1| ) according to our discussion in section 4.1.2, we can use the implications 



of the ordering rule (see the end of section (4.1.l|) as a guiding tool. 

The first implication of the ordering rule is that a box in row i of the YD ip cannot 
be attached to a row j of the YD B Hi < j, see end of section |4.1.1| . This can be easily 
understood in the case of SSYT and has to do with the fact that the labels in the same 
column should be strictly increasing from top to bottom and labels in the same row should 
be weakly increasing from left to right. In the language of SSYT, i will be the row number 
and j will be the label. Due to the aforementioned conditions on the labels in a SSYT, the 
labels in row i should bigger or equal than i, which implies the absence of the label j in 
row i if j < i. 

The second implication of the ordering rule (see end of section |4.1.1|) can also be 
understood as a consequence of the conditions of the labels of SSYTx. In terms of the 
tensor product decomposition, it was shown that if the boxes added to a row i of B come 
from different rows of ip with m being the highest one among them, than the number of 
added boxes to the row i is less or equal than ip m the length of this row. In terms of the 
SSYT labeling, this situation corresponds to the presence of the label i in the row m of ip. 
Due to the conditions on the labeling of SSYT, any other box with label i should be to 
the left of this box even if it belongs to a the rows below row m. As a result the number 
of labels i in the SSYT ip is less or equal ip m the length of the row m of ip. 

The combination of the previous two conditions, together with the fact that the filling 
(3 is a totally ordered filling, implies the following condition for a non-vanishing Kostka 
number K^^p, see for example |33|]. The Kostka number is nonzero if and only 

if both ip and f3 are partitions of the same integer h and moreover ip is larger than /3 
in the dominance order. The latter is defined as follows. Let ip = {ip\ , ip2 , • • • , ipn), 
(3 = (/?i , , ■ ■ ■ , fim) with ipi > tp2 > ■ ■ ■ and /3i > > . . . be two ordered tuples of 
integers. We say that ip is larger than (5 in dominance order, and we write tp > j3, if and 
only if for each integer k > 1, the following is true: 

k k 

Vfc>l; 5>i>J>, 

i=l i=l 

where we fill in the non-existing integers ip^ for k > n by zeros. An immediate consequence 
of this is that the number of entries in /3 should be bigger or equal to the the number of 
rows of ip. Notice that this condition is only true for /3 being totally ordered. If (3 is not 
ordered than the statement above works in one directions only: K$ p nonzero then we 
have the dominance order condition but not the way around. 



7 We choose B such that the number of its columns is large enough. 
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E.3 Maximizing Kostka numbers 

The final point in our investigation on the Kostka numbers is to find conditions on the filling 
such that we get a maximum value for K a ^ p . This is important as we are interested in 
leading order behavior of the degeneracy of YD appearing in the decomposition of O (g) tft, 
and as we will see in subsection and section F.2 of appendix-^, this degeneracy is 



intimately connected to the maximum Kostka number in some cases of interest to us. 
Notice that we are fixing the shape ip here and varying 0. Two claims can be made here. 
First of all, \0\ the "length" of the filling should be maximized. By length we mean 
the number of non-zero entries 0i 7^ 0. Secondly, the entries 0i should be very close to 
the average h/\0o\ 18 • I n the following h will stand, as usual, for the number of boxes 
of the tableau ip whose shape is also denoted by ip, whereas N is, as usual, the flux of 
the background geometry which is the same N that appears in the group of our interest 
SU(AQ. 

The first claim is easy to understand. The condition on the columns ordering implies 
that we cannot have the same label on more than one box in the same column. Hence, to 
have more options in labeling we need to have more labels. As a result the bigger \0\ the 
more options we have. In the case N < h, the biggest \0\ possible is h which implies that 
0i = 1. So, the condition on \0\ is strong enough to imply the second claim above. Things 
are more complicated in the case N < h since here we have at least one 0i that is bigger 
than "1" for fillings associated with the largest possible value of \0\ = N. In this case the 
second claim adds a non trivial condition on the filling 0. The argument for the validity 
of the second claim in this case is as follows. 

First of all we are going to work with the ordered filling 0. We will also concentrate 
on the case N < h. Hence we are starting with the iV-tuple = {0\ , 02 , ■ ■ ■ , 0n) such 
that: 

N 



Let us first concentrate on 0\ and 02- Let us fix the sum 0\ + 02 but allow each of them to 
vary. Notice that the boxes with label 1 are confined to the first row whereas the ones with 
label 2 have the freedom to be located in either of the first tow rows. So, the bigger 02 the 
bigger the Kostka number. We reach the maximum if 02 ~ 0\ since 02 < 0\- Repeating 
the same argument for the labels i for 3 < i < n where we use that boxes with label j are 
confined to be in the first j rows, we conclude that to maximize the Kostka number we 
should choose: 

~0 X « 02 « . . . « n . 

To complete the argument we start from below. Once again fixing the sum 0n + 0n-i, 
but allowing both of them to vary. The position of the boxes with label iV is at the end of 
the rows with the condition that there are no boxes below them. For the boxes with label 



18 An intuitive argument to why such a choice is special is as follows. We argued before that the Kostka 
numbers are invariant under permutations of the entries of the filling j3. The fixed point of such a permu- 
tation is when all the entries are equal which reproduces precisely this choice. 
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(N — 1), they can be either on top of the boxes with label N, or to their left under the 
condition that if there are boxes below, they should carry the label N. So, if we have in 
a row k boxes with label N, we have at least k possible boxes to carry the label (N — 1). 
Hence the bigger /3jv> the more options we have which leads to the optimum situation 
Pn~i ~ An- We can repeat the same argument for the other labels and reach a similar 
conclusion. Hence the second claim. 

In the argument above, we cheated a little bit. Remember that there should be no 
boxes below the ones with label N. So the number of such boxes i.e. boxes with no box 
below in the YD, constitute an upper bound on /3n- So, the claim above should be modified 
accordingly. However, as we will see in subsection [D|, our case of interest corresponds to 
h/N is small enough, together with the type of YD we are dealing with put us in a safe 
position. 



F. From Kostka numbers to actual degeneracies dk 

In the following we will discuss the possible modifications to the conclusions derived in 



subsections 4.2 and 4.2 for SSYTx if) and their Kostka numbers once the YD and the 
antisymmetry rules are included to make contact with the tensor decomposition (|3.3|). 
Following the arguments for the Kostka numbers, the discussion depends on whether h <C 
N, or (h~N, h^>N). 

F.l The case h < N 



We arrived in subsection 4.2 at the conclusion that, without the inclusion of the YD and 
the antisymmetry rule, we can take the YDs ipl as representatives of the the YDs (ff. that 
appear in the decomposition of the tensor product O ® tp. Remember that the YDs ip^ 
are the result of adding at most one box to each row of O. Their degeneracy d® as well as 
their number Mo have the leading terms given in equation ( |4.8[) , which we rewrite here for 
convenience. 

h N 
log d k ~ h log — , log Mo « h log — . (F.l) 

ip h 

What happens when we take into account the YD and the antisymmetry rules? For con- 
creteness, let us restrict ourselves to the special case of the background YD Oq whose rows' 
length is given by: 

O l = N - * ; i = l,2, ... , N -1 . 

This corresponds to the limit shape YD of the superstar ensemble where the number of 
columns of its YDs is fixed to be N as well. We will discuss the possible modifications that 
we need to take into account for the general case at the end. 

Notice that for this background Oq neither the YD rule nor the antisymmetry rule 
modify the results above since both are trivially satisfied. Hence there are Mo YDs that 
are constructed by adding at most one box to each row, each of them has the degeneracy 
d?, which in total reproduces the leading behavior 19 of dhmy(/>. We will not discuss the 



This is after taking the log of the numbers under consideration. 
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other YDs <pk since they will not play a role in what we are trying to do, see subsection 
[5Xg for more details. 

To close the line of thoughts, we need to discuss what happens in the case of a different 
background O. We need to distinguish between a YD O in the same ensemble as Oq, and 
YD in a different ensemble. The worst scenario in both cases amounts to replacing N by 
kN, with k < 1. The idea is to replace every set of consecutive equal length rows with just 
one row, which brings our discussion close enough 20 to the one for Oq- The only thing we 
need to make sure about is that after doing so, we are still left with order N rows. First, 
let us deal with a YD O that belongs to the same ensemble as Oq. In this case we know 
that the fluctuation in the row length is 5 Oi ~ y/~N JIJ (see also subsection |2.1| ). Hence, 
the number of consecutive rows of equal length can be at most of order yN. So, in general 
we can have N a , a < 1/2 consecutive rows of equal length. For these set of rows, the 
number of moved boxes is of order N 2a /2. Since the total number of moved boxes in O 



when compared with 0$ is of order TV [[jj (see also subsection 2A ) , the number of such sets 
should grow slower than iV 1-a . Hence once we collapse these sets to just one row we will 
end up with a YD whose number of rows is of order N. Applying the same considerations 
as in the case of Co, we conclude that the leading behavior of both the number of different 
YD in the tensor product decomposition as well as their degeneracy does not change, since 
the only thing one needs to do is to replace N there by k N inside the log. 

What about a YD from another ensemble? We start with its limit shape YD. We can 
have cases where this limit shape YD has sets of order iV° of equal length, for example 
N = 2D, where N is the number of rows whereas D is the number of columns. In these 
cases we should collapse these sets to just one row. In the opposite cases, we leave the limit 
shape YD untouched. All in all we end up with a YD where the number of its rows is of 
order N, and where the shift of length between two consecutive rows is at worst one box. 
As a result, we can borrow the discussion above to conclude that the leading behavior of 
the number of the different YDs in the tensor product decomposition as well as the leading 
behavior of their degeneracy remains intact. For the other YDs in this ensemble, we can 
easily adapt the discussion of the YD in the Oq ensemble to this case, reaching the same 
conclusion. 

All in all, in the tensor product decomposition of 0®ip, where if) is a YD with number 
of boxes h <C N, the number of different YDs and their degeneracy has the following 
leading behavior: 

h N 
log d k » h log — , logM « h log — , (F.2) 

where, according to our conventions (appendix-|A|), V'o = Max{n, d}, d is the number of 
columns of i/j, and n is the number of its rows. 

F.2 The cases h ~ N and h » N 



We arrived in subsection 4.3 to the conclusion that, in the case where i/j is a generic probe 



and h ~ ./V or h 3> N, and forgetting about the antisymmetry rule, there are special YDs 



20 In general, the shift in the length of consecutive rows can be more than one box which does not change 
the essence of the discussion in the case of Co- 
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tPk in the tensor product decomposition (|3.3| ) whose degeneracy d* k has a leading behavior 
given by equation ( [4.10D , which we rewrite here for convenience: 



log d* k « dirn/v ^ ~ h log — . (F.3) 

These YDs are constructed by adding hi ~ (h/N) boxes to each row of the YD O. Notice 
that hi <C d and hi <C n since n, d <C iV and /i ~ nrf. In this appendix, we want to study 
the implication of taking into account the antisymmetry rule. To fix notations below we 
will denote by f3o the filling of the SSYT ip* associated to one of the YDs ip^. Hence: 

Po = (hi , h 2 , ■ ■ ■ , h N ) , hi& , hi > h 2 > . . . > h N , 

where hi is as usual the degeneracy of the label i. We have the following leading behavior 
of the associated Kostka number: 

log K a> p « dimjv ip , 

We further concentrate on the simple case where the background YD Oq whose rows' length 
is given by: 

d = N -i; i = l,2, ... , N -1 . 

This the same background YD that we used in the explicit discussion in the previous 
section. We will come back to the possible modifications to the arguments below at the 
end of this section. 

What does the antisymmetry rule imply? First of all, the labels associated to the first 
raw should be different for each box except for the first few ones that carry the label 1. 
So, for i > 1 we cannot find more than one box in the first row with this label. The same 
argument leads to the conclusion that in row k, there should be at most one box with label 
i > k. So essentially, taking into account the antisymmetry rule forces the labeling of the 
YD ip* to be of the following form. The label 1 < i < n is mostly confined to the row i. In 
other words there will be at most one box with label j in a row i ^ j. The remaining labels 
i > n will appear at most once in each row. To proceed further, we write schematically 
the different contributions to the associated Kostka number as: 

^,/3o=< ) /3o +E< ) /3o' ( F - 4 ) 
a 

where o stands for the contribution giving rise to the degeneracy we are after i.e. the 
labelings that satisfies the required conditions discussed above, and the sum is over the 
remaining "bad" contributions. We will call the former the good labelings and the latter 
the bad labelings. The meaning of the index a in the sum will be clear below, but at the 
moment it stands for some indexing of the bad labelings. Our aim in the following is to 
relate somehow the bad labelings to the good ones. But before that, let us first deal with 
the special labels {1 , 2 , ... , n}. Remember that these labels are special in the sense that 
they are the only ones that are allowed to occur more than once given that they are in the 
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row with the same number as the label. In other words, we can have more than one box 
with label i if and only if 1 < i < n and the associated boxes belong to row i in the tableau 
tft. Let us count the number of labelings of ip that violate this requirement. It is easy to 
see that this number is smaller than: 

n 

log n wrong = log ~ (-^) h log N < h log N . 

i=i 

Given that log K aj ^ ~ h logiV, we can safely restrict ourselves to counting the number 
of labelings where the first n labels are fixed as above. Said differently, we will try in the 
following to estimate the number of labeling K a p such that the labels 1 < i < n are 
mainly confined to be each in its corresponding row i. Our discussion below will concern 
only the labels n < i < N . 

A useful way to think about counting the labelings K a g is as follows. First construct 
all the labelings where each row of tp has at most one box with label i, for all the labels 
n < i < N. These labelings were called good labelings and their number is the one 

we are after. For each labelings among these, we will move the boxes around in order to 
increase the degeneracy of labels in the rows of tp. For each good labeling a, the number 
of bad labelings constructed so K^ a K is smaller than: 

N 
k=l 

where we used that Ylk = h. It is easy to convince yourself that in this way, we will 
be able to construct all the labeling of ip associated to /3q. The reason is once again the 
inequality hi <C n. This is because, starting from a bad labeling, one can find a lot of 
possible rows to accommodate the degenerate labels in order to construct at the end a 
good labeling of ip. 

Now identifying the label a in the sum ( |F.4| ) with the index of good labels, together 
with the identification K^Bo = ^a\ ana - ^ s u PP er bound above, we conclude that to 
leading order: 

log K^lfa Kt log K a t ft) w log dim^ rp ~ h log N . (F.5) 

This result looks universal despite the fact that we started with a specific background 
tableau Oq. Actually one can easily generalize the arguments above to other backgrounds, 
either belonging to the same ensemble as Co, or other ensemble. The key idea is the 
observation mentioned above regarding the scale of hi. Remember we have hi <C n, d. Let 
us discuss each case on its own. 

For other backgrounds in the ensemble of Oq, the complication arises because in this 
case we do not have the simple picture of a jump by one box between consecutive rows as 
in Oq. We could either have some rows with the same length, or the difference in length of 
some consecutive rows will be larger than one box. The first situation in principle reduces 
the number of possible YDs in the tensor decomposition, however we can easily see that 
the leading order does not change. An easy way to see this is to follow the steps bellow to 
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construct YDs in the decomposition of the tensor product O®^, where O is some YD in 
the same ensemble of Oq. 

• The effect of having sets of rows with equal length has at worst the effect of reducing 
./V inside the log to a fraction of it, which does not modify the leading order (see 
the end of the previous section). A way to think about this is to imagine that we 
replaced our background Young tableau O with a new one O by replacing all the sets 
of rows of equal length by a single row. By doing so we reduce the range of possible 
labels to just a fraction of N. But since we are not altering the inequalities hi <C n, 
d, we can still use our original arguments used in the case of Oq. 

• After deflating O to O, we first construct all the good labelings associated to the 
latter YD. After that, we inflate back O to the original YD O. At the end of this 
operation we will end up with diagrams that are not YDs. Essentially, the rows right 
after each set of equal length rows of O might be bigger than the one above it. But 
we can redistribute the extra boxes on the empty rows since they come from different 
rows in tp, and hence they do not violate the antisymmetry rule. The ordering rule 
is also preserved because of the way we construct the tensor product if we move the 
boxes in the following way. We just cut the excess boxes and slide them up without 
changing their order. This is explained in the following example: 
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As a result, we do not change the leading order given in equation 



The second difference, the difference in length of consecutive rows is bigger than one box, 
is beneficial as it allows to have more options because now we can allow for more than one 
box with the same label i > n, for some i's. But since already the leading behavior of the 



degeneracy is saturated by ([F.4D , we conclude once again that the leading term remains 
intact. 

All in all, dealing with a different background YD in the same ensemble of Oq leads 
to the same leading term of the largest degeneracy of YDs in the decomposition of O ® tp 



as in equation (F.4). 

In the case of other ensembles, the same story goes through as the only change in 
this case can be reabsorbed by replacing N by a fraction of it in the worst case. This is 
because the background YDs we are dealing with are the ones where the number of rows 
is iV and the number of columns N c is of order N, and the number of boxes in fixed to be 
(N c N r )/2. 
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